WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Intieniationa) Bureau 




PCX 

INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classificafion 7 ; 

C12Q 1/68, C07H 21/04, A61K 48/00, 

C12N 15/00, isns 



Al 



(11) Internatioiial Publlcatioii Number: 
(43) bitmatkMiai Publication Date: 



WO 00/18959 

6 April 2000 (P6X)400) 



(21) International Application Number: PCTAJS99/216S4 

(22) International FWng Date: 17 Sqttcmber 1999 (\7J09S9) 



(30) Priority Data: 
09/156,191 



(63) Related by 
(OP) to 
US 

Hied on 



17 September 1998 (17X19.98) US 



(CON) or Continuation-in-Fart 
Application 

09/156,191 (OP) 
17 Sqytember 1998 (17.09^) 



(71) Applicant (Jbr aU desigmted SiaSes except US): THE 

IKUSTEBS OP COLUMBIA UNIVERSnY IN THE 
CrrV OF NEW YORK [USOJS]; West 116th Street and 
Broadway, New York, NY 10027 (US). 

(72) Inventor; and 

(75) Inventoi/iM^pUcant (for US only)i SATO, Taka-Ald [US/US]; 
Apartment 8P. 1275 15th Street. Fort Lee, NJ 07024 (US). 

(74)Agait: WHTIB, John, Cooper & Dunham LLP, 1185 
Avenue of the Americas. New Yoric, NY 10036 (US). 



(SI) Dedgnated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY. CA. CH, CN, CR. CU. CZ. DE. DK, EE. ES, PI. 
GB, GO. GE, GH, GM. HR. HU. ID. IL, IN. IS. JP. KB. 
KG, KP. KR. K2, LC. LK, LR. LS, LT, LU. LV, MD. MG. 
MK, MN. MW, MX, NO. NZ. PL, PT, RO, RU. SD, SB, 
SG, SI. SK, SL. TJ, TM, TO, TT. TZ. UA, UG, US. UZ. 
VN, YU, ZA, ZW, ARIPO patent (GH, GM. KE. LS. MW. 
SD. SL, SZ, IZ, UG, ZW), Eurasian patent (AM. AZ, BY, 
KG, KZ. MD, RU, 17, TM), Eunqiean patent (AT, BE, CH. 
CY, DE, DK, ES, PI, PR, GB. GR, IB, FT, LU, MC, NL, 
FT, SE), OAPI patent (BP, BJ, CP. CO, a. CM. GA, (3^, 
GW. ML, MR, NE, SN. TD. TO). 



PubBdied 

With international search report 
Before the expiration of the time limit Jbr 
claims and to be repubiished in the event- if the 
amendments. 



the 
of 



(54) Title: TREX, A NOVEL GENE OF TRAF-INTERACmNG EXT GENE FAMILY AND DIAGNOSTIC AND THERAPEUHC 
USES THEREOF 

(57) Abstract 

This invention provides an isolated nucleic acid molecule encoding a Himor necrosis factor Receptor-Associated Factor (TRAP) 
protein-interacting hereditary multiple extoses (TREX) protein. This invention also provides vectors comprising the isolated nucleic acid 
molecule encoding a TREX protein. This invention fuitiier provides a purified TOEX protein and antibodies diereto. This invention 
provides oligonucleotides comprising a nucleic acid molecule of at least 15 nucleotides c^>able of specifically hybridizing widi a unique 
sequence included wiUiin the sequence of an isolated nucleic acid molecule encoding TREX protein. This invention provides an antisense 
oligonucleotide ovnprising a sequence capable of specifically hytmdizing with a unique sequence included widiin a genomic DNA molecule 
encoding a Tumor necrosis factra- Rec^tor-Associated Factor (IRAF) protem-int»acting hereditary multiple extoses (IREX) protein. Tltis 
invention provides a monoclonal antibody directed to an epitope of a TREX protein. This invention provides medKxIs of inhibiting TREX 
protein interaction widi a TRAP protem; of inhibiting overexprcsslon of TREX protein; of inhibiting growdi of a tumor; of treating 
abnormalities in a subject associated with overexpression of TREX. This invention provides pharmaceutical compositions ccmiprising 
oligonucleotides effective to prevent overexpression of a IREX protein or antibodies effective to block binding of a TREX protein to a 
TRAP protein; screening for compounds which inhibit IREX protein and TRAP protein binding; of detecting predispositions to cancers 
comprising TOEX mutations; and of diagnosing cancer comprismg TREX mutations. 
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TRBX. A NOVEL GENE OP TRAP -INTERACTING EXT GENE FAMIIiY AMD 
DIAGNOSTIC AMD THERAPEUTIC USES THEREOF 

.This application claims priority and is a continuation- in- 
10 part application of U.S. Serial No. 09/156,191, filed 
September 17, 1998, the contents of which is hereby 
incorporated by reference. 

STATEMENT REGARDING FEDERALLY FUNDED RESEARCH 

15 

The invention disclosed herein was made in part with 
Government support imder NIH Grant No. R01GM55147. 
Accordingly, the U.S. Government has certain rights in this 
invention. 

20 

Throughout this application, various references are referred 
to within parentheses. Disclosures of these publications in 
their entireties are hereby incorporated by reference into 
this application to more fully describe the state of the art 
25 to which this invention pertains. Full bibliographic 
citation for these references may be found at the end of 
this application, preceding the claims. 

BACKGROUND OF THE INVENTION 

30 

Tumor necrosis factor (TNF) receptor-associated factor 
(TRAF) proteins contribute to signal transduction induced by 
TNF receptor family signaling. TRAF3 cloned as binding 
protein to the cytoplastic domain of CD40, a member of TNF 

35 receptor superfamily, is believed to be involved in 
signaling pathway induced by CD40, Lymphotoxin (LT) S 
receptor, CD30 ligation (1-7) . Here we report molecular 
cloning of a novel TRAF- interacting protein named as TREX 
because of TRAF- interacting EXT (hereditary multiple 

40 exostoses) gene family protein. TREX has highly homologous 
sequence to the EXT gene family, a candidate of tumor 
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suppressor gene. TREX strongly Interacts with TRAF2 and 
TRAF3, and TREX and TRAP protein colocalize in mamnialicui 
cells. Moreover, overexpression o£ TREX modulates NF-kB 
activity induced by TRAF-mediated signaling. These findings 
5 indicate that TREX and the other EXT gene family proteins 
can function as a mediator in receptor signaling and could 
be involved in tumorigenesis.^ 

10 

SUMMARY OF THE INVENTION 

This invention provides cui isolated nucleic acid molecule 
encoding a Tuim^r necrosis factor Receptor-Associated Factor 
15 (TRAF) protein- interacting hereditary multiple extoses 
(TREX) protein. 

This invention provides an isolated nucleic acid molecule 
encoding a mutant homolog of the mammalian Tumor necrosis 
20 factor Receptor-Associated Factor (TRAF) protein- interacting 
hereditary multiple extoses (TREX) protein whose mutant 
sequences (genetic alterations) are shown in Table 3 infra. 

25 This invention provides a vector comprising the isolated 
nucleic acid molecule encoding a Tumor necrosis factor 
Receptor-Associated Factor (TRAF) protein- interacting 
hereditary multiple extoses (TREX) protein. 

30 This invention provides a purified mammalian Tumor necrosis 
factor Receptor-Associated Factor (TRAF) protein- interacting 
hereditary multiple extoses (TREX) protein. 

This invention provides a protein comprising substantially 
35 the amino acid sequence set forth in Figure lA (SEQ ID N0S:2 
and 4) . 
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This invention provides an oligonucleotide coinprising a 
nucleic acid molecule of at least 15 nucleotides capable of 
specifically hybridizing with a unique secjuence included 
within the secjuence of an isolated nucleic acid molecule 
5 encoding a Tumor necrosis factor Receptor -Associated Factor 
(TRAP) protein- interacting hereditary multiple extoses 
(TREX) protein. 

This invention provides an antisense oligonucleotide 
10 comprising a sequence capable of specifically hybridizing 
with a unique secpience included within an mRNA molecule 
encoding a Tumor necrosis factor Receptor-Associated Factor 
(TRAP) protein- interacting hereditary multiple extoses 
(TREX) protein. 

15 

This invention provides an antisense oligonucleotide 
comprising a sequence capable of specifically hybridizing 
with a unique sequence included within a genomic DNA 
molecule encoding a Tumor necrosis factor Receptor- 
20 Associated Factor (TRAP) protein- interacting hereditary 
multiple extoses (TREX) protein. 

This invention provides a monoclonal antibody directed to cui 
epitope of a Tumor necrosis factor Receptor -Associated 
25 Factor (TR7VF) protein- interacting hereditaiy multiple 
extoses (TREX) protein. 

This invention provides a method of inhibiting TREX protein 
interaction with a TRAP protein comprising administering a 
30 ligand comprising an amino acid domain which binds to a EXT 
C domain of the TREX protein so as to inhibit binding of the 
TREX protein to the TRAP protein. 

This invention provides a method of inhibiting 
35 overe^qpression of TREX protein comprising administering any 
of the above -described antisense oligonucleotides which bind 
to an mRNA molecule encoding a human Tumor necrosis factor 



wo 00/18959 PCTAJS99/21<i54 

-4- 

Receptor-Associated Factor (TRAP) protein- interacting 
hereditary multiple extoses (TREX) protein so as to inhibit 
overexpression of the human TREX protein. 

5 This invention provides a method of inhibiting growth of a 
tumor cell comprising blocking a TRAP interacting site of a 
TREX protein by administering a ligand capable of binding to 
the TRAP interacting site of a TREX protein. 

10 This invention provides a pharmaceutical composition 
comprising an amount of any of the above -described 
oligonucleotides effective to prevent overexpression of a 
TREX protein emd a pharmaceutically acceptable carrier 
capable of passing through a cell membrane. 

15 

This invention provides a pharmaceutical composition 
comprising an amount of any of the above -described 
antibodies effective to block binding of a TREX protein to 
a TRAP protein and a pharmaceutically acceptable carrier 
20 capable of passing through a cell membrane. 

This invention provides a method of treating an abnormality 
in a subject, wherein the cdbnormality is alleviated by the 
inhibition of binding of a TREX protein and a TRAP protein 
25 which comprises administering to the subject an effective 
amoiint of the above described pharmaceutical conqposition 
effective to block binding of the TREX protein and the TRAP 
protein in the siibject, thereby treating the abnormality in 
the subject. 

30 

This invention provides a method of treating an abnormality 
in a subject, wherein the abnormality is alleviated by the 
inhibition of overexpression of a TREX protein which 
conprises administering to the subject an effective amoxint 
35 of the above -described pharmaceutical composition effective 
to inhibit overexpression of the TREX protein, thereby 
treating the abnormality in the s\abject. In a preferred 
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embodiment the abnormality is cancer, a hereditary noiltiple 
extosis or an autoimmune disease. 

This invention provides a method of screening for a chemical 
compound which inhibits TREX protein and TRAP protein 
binding conprising: (a) incubating the chemical compound 
with a TREX protein and a TRAP protein; (b) contacting the 
incubate of step (a) with an affinity medium under 
conditions so as to bind a TREX protein-TRAF protein 
complex, if such a complex forms; cuid (c) measuring the 
amount of the TREX protein-TRAF protein conplex formed in 
step (b) so as to determine whether the compound is capable 
of interfering with the formation of the conqplex between the 
TREX protein-TRAF protein. 

This invention provides a method of preventing inhibition of 
a CD40 signal -dependent NF-kB activation comprising 
administering any of the above -described antisense 
oligonucleotides which bind to an mRNA molecule encoding a 
human Tumor necrosis factor Receptor-Associated Factor 
(TRAF) protein- interacting hereditary multiple extoses 
(TREX) protein so as to prevent inhibition of CD40 signal- 
dependent NF-kB activation. 

This invention provides a method of preventing inhibition of 
activation of a CD40 signal -dependent NF-kB comprising 
administering a ligand comprising an amino acid domain which 
binds to a EXT C domain of the TREX protein so as to inhibit 
binding of the TREX protein to the TRAF protein, thereby 
preventing inhibition of activation of a CD40 signal- 
dependent NF-kB. 

This invention provides a method of preventing upregulation 
of a TNF receptor typell signal -dependent NF-kB activation 
comprising administering any of the above -described 
antisense oligonucleotides which bind to an mRNA molecule 
encoding a hiomaui Tumor necrosis factor Receptor -Associated 
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Factor (TRAP) protein- interacting hereditary multiple 
extoses (TREX) protein so as to prevent upregulation of a 
TNF receptor typell (TNFRII) signal -dependent NF-kB 
activation. 

This invention provides a method of preventing upregulation 
of activation of a TNF receptor typell (TNFRII) -signal - 
dependent NF-kB comprising administering a ligamd cortprising 
an amino acid domain which binds to a EXT C domain of the 
TREX protein so as to inhibit binding of the TREX protein to 
the TRAP protein, thereby preventing upregulation of 
activation of a TNF receptor typell -signal -dependent NF-kB. 

This invention provides a method of detecting a 
predisposition to cancer which comprises detecting of a 
mutation in a nucleic acid encoding TREX protein in the 
sample from the subject. 

This invention provides a TREX nucleic acid probe comprising 
a sequence capable of specifically hybridizing with a unique 
sequence included within the above -described isolated DNA 
molecule encoding a Tumor necrosis factor Receptor- 
Associated Factor (TRT^) protein- interacting hereditary 
multiple extoses (TREX) protein. 

This invention provides a method of diagnosing cancer in a 
subject which comprises: a) obtaining DNA from the sample of 
a subject suffering from cancer; b) performing a restriction 
digest of the DNA with a panel of restriction enzymes; c) 
separating the resulting DNA fragments by size 
fractionation; d) contacting the resulting DNA fragments 
with a nucleic acid probe capable of specifically 
hybridizing with a unique sequence included within the 
sequence of a genetic alteration of a nucleic acid molecule 
encoding a TREX protein, wherein the nucleic acid is labeled 
with a detectable marker; e) detecting labeled bands which 
have hybridized to the nucleic acid probe in step (d) , 
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wherein the sequence of a genetic alteration of a nucleic 
acid molecule encoding a TREX protein creates a unique band 
pattern specific to the DNA of subjects suffering from 
cancer; f ) preparing DNA obtained from a sanple of a subject 
5 for diagnosis by steps (a-e) ; and g) comparing the detected 
band pattern specific to the DNA obtained from a sample of 
sxibjects suffering from cancer from step (e) and the DNA 
obtained from a saitple of the subject for diagnosis from 
step (f) to determine whether the patterns are the same or 
10 different and to diagnose thereby predisposition to cancer 
if the patterns are the same. 

This invention provides a method of diagnosing cancer in a 
subject which comprises: a) obtaining RNA from the sample of 

15 the subject suffering from cancer; b) separating the RNA 
sample by size fractionation; c) contacting the resulting 
RNA species with a nucleic acid probe capable of 
specifically hybridizing with a unique sequence included 
within the sequence of a nucleic acid molecule encoding a 

20 mutated TREX protein, wherein the seqpience of the nucleic 
acid molecule encoding the mutated TREX protein is IcUbeled 
with a detectable marker; d) detecting labeled bcuids which 
have hybridized to the RNA species to create a unique band 
pattern specific to the RNA of subjects suffering from 

25 cancer; e) preparing RNA obtained from a sample of a subject 
for diagnosis by steps (a-d) ; and f ) comparing the detected 
band pattern specific to the RNA obtained from a sample of 
subjects suffering from cancer from step (d) and the RNA 
obtained from a sample of the subject for diagnosis from 

30 step (f) to determine whether the patterns are the same or 
different and to diagnose thereby predisposition to cancer 
if the patterns are the same. 

35 
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BRIEF DESCRIPTION OF THE FIGURES 

5 Figures lA-lP. Amino acid sequences of TREX and expression 
of TREX. Fig. lA^ Predicted amino acid sequences of mouse 
and human TREX. Identical residues are boxed. Partial clones 
obtained by two-hybrid screening are indicated by brackets. 
Isoleucine and leucine residues that form putative 

10 isoleucine zipper motif are boxed and darkly shaded. Fig. 
IB, Schematic representation of putative domain structure of 
EXT gene family proteins. Conserved domains are indicated as 
EXT-N and EXT-C domain. Fig. IC, Sequence alignments of 
EXT-N domain. Conserved residues are shaded. Fig. ID, 

15 Sequence alignments of EXT-C domain. Conserved residues are 
shaded. Fig. IE, Northern blot analysis of TREX mRNA. 
Multiple tissue northern blot (Clontech) were probed with 
human or mouse TREX cDNA. Fig. IF, Expression of TREX 
protein in human cells. Cell lysates of KM12L4 cell line 

20 were immunoprecipitated with either rabbit preimmune IgG or 
rabbit anti-TREX antibody. TREX proteins were detected with 
cuiti-TREX antibody (107 kDa) . 

Figure 2A-B. Intracellular association of TREX and TRT^ 
25 family proteins. Fig. 2A, 293 T cells were transiently 
trcinsfected with myc- tagged TREX together with FLAG- tagged 
TRAFs- Cell lysates were immunoprecipitated with preimmune 
rabbit IgG (Control) or rabbit anti-rayc antibody (cmryc) . 
Coimrmmoprecipitated TRAF proteins were analyzed by Western 
30 blotting using anti-FLAG antibody. Expression of TRT^ 
proteins was monitored by Western blotting using cell 
lysates (bottom) . Fig. 2B, Colocalization of TREX and TRAF3 
in mammalian cells. C0S7 cells were transfected with 
myc-tagged TREX or TRAF3 . Myc-tagged TREX (R-phycoerythrin, 
35 red) localized around nucleus as similar with TRAF3 (FITC, 
green) . 
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Flgxire 3, TREX modulates NP-kB activity induced by 
TRAF-mediated signaling pathway. 293 cells were transiently 
transfected with NF-RB-dependent reporter gene together with 
several amounts of TREX in the presence of CD40 and CD40 
5 ligand (a) or TRAF2 (b) . Luciferase activities were 
determined cuid normalized by co-transfection of pRL-CMV 
using dual -luciferase assay kit (Promega) - 

Figure 4. TREX upregulates NF-kB activity induced by 
10 TNFof- induced NF-kB activation in hiiman embryonic kidney 293 
cell. 293 human embryo kidney cells were maintained in MEM 
containing 10% FCS, 100 iig/val penicillin G and 100 /ig/ml 
streptomycin. For reporter assay, 10* cells were seeded on 
100 mm dishes and grown for 3 days in 5% COg at 37* C. The 
15 cells were transfected with reporter DNA (luciferase) and 
either empty (pcDNA3 . 1 ( - ) /MYC HIS) or mTREX escpression 
plasmid (pcDNA3 . 1( - ) /MYC HIS-m TREX) by the calcium 
phosphate precipitation method. After 12 h, the cells were 
treated with or without 20 ng/ml TNF-alpha. After an 
. 20 additional incubation for 12 h, the cells were washed with 
PBS and then the luciferase activities were determined by 
using Dual luciferase reporter assay system (Promega) . 

Figure 5. Chromosomal mapping of the TREX gene on 
25 chromosome 8pl2-p21. The biotin-labeled TREX cDNA probe and 
the digoxigenin- labeled chromosome 8 centromere- specific 
probe were cohybridized to a normal humsm metaphase (a) or 
prophase (b) spreads and detected with avidin FITC (green 
signals) and anti-digoxigenin-rhodamine (red signals) , 
30 respectively. Chromosomes were counters tained with DAPI 
(blue) . 

Ficpire 6. Genomic organization of TREX gene. Exon-intron 
distribution is shown in upper panel. The 7 exons are 
35 indicated by box and numbered. The size of intron is also 
indicated in kilobases. The middle panel represents the TREX 
cDNA with translation initiation site (ATG) and termination 
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site (TAG) . Closed box cuid open box in these represent the 
coding region and non- coding region, respectively. 

Figures 7A-7B. Fig. 7A. Mouse TREX cDNA nucleotides 1-3479. 
5 (SEQ ID NO: 1); Mouse TREX cDNA Genbank Accession NO. 
AF083550. Fig. 7B. Mouse TREX cDNA nucleotides and the 
predicted amino acid sequence (SEQ ID NO: 2) , 



Figure 8A-8B. Fig. 8A. Human TREX cDNA nucleotides 1- 

10 6172. (SEQ ID NO: 3); Human TREX cDNA Genbank Accession NO. 
AF083551. Fig. 8B. Human TREX cDNA nucleotides and the 
predicted amino acid sequence (SEQ ID NO: 4} 

Figures 9A-9B* Sequence aligxunen^ o£ mouse and human EXTL3 
15 proteins and eacpresslon o£ mouse EXIL3 and itiRNA in various 
tissues. Fig. 9A. The amino acid sequence of mouse EXTL3 
(AF083550) and hiiman EXTL3 (AF083551) were aligned by using 
GENETYX-MAC 9.0 Identical residues are boxed, and a putative 
isoleucine zipper motif is shaded. Fig. 9B. Expression of 
20 the mouse EXTL3 gene on a commercial Northern blot 
(Clontech) of eight different tissues using a cDNA fragment 
as a probe- The various tissues are labeled at the top, and 
the size markers axe indicated on the left. A transcript of 
about 6kb is present in all tissues. 

25 

Figures IDA- IOC. Enhancement of NF-kB activation stimulated 
by TNF-a In HEK293 cells overeaq^resslng EXTL3. Fig. lOA. 
HEK293 cells were transfected with pcDNA or pcDNA/EXTL3. 
After 12 h, the cells were stimulated with or without 20 

30 ng/ml TNF-of for 1 h. Then, nuclear extracts prepared from 
the cells were analyzed by using a electrophoretic mobility 
shift assay with NF-kB consensus oligonucleotide. Fig. lOB. 
The indicated amotint of pcDNA/EXTL3 was cotransfected with 
500 ng of the luciferase reporter plasmid pELAM-luc and 500 

35 ng pRL-TK into HEK293 cells. The total amount of pcDNA 
constructs was adjusted to 10 fig by addition of en4)ty 
vector. After 12 h, the cells were treated with or without 
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20 ng/ml TNF-a. At 12 h after stimulation, cell lysates 
were prepared and siibjected to a dtial lucif erase assay. All 
values representing luciferase activities were normalized 
and are shown as the mean±SEM of trimplicate samples. Fig. 
5 IOC The indicated amount of pcDNA/EXTL3 and 5 jig of HA- 
tagged human TRAP2 construct were transfected with 500 ng of 
the luciferase reporter plasmid pELAM-luc and 500 ng pRL-TK 
into HEK293 cells. The total amoamt of pcDNA constructs was 
adjusted to 10/xg by adding an empty vector. After 24 h, 
10 cell lysates were prepared and subjected to the dual 
luciferase assay. All values representing luciferase 
activities were normalized and are shown as the mecui±SEM of 
triplicate samples. 

15 Figures llA*-llDa-llDc. Effects of EXTL3 truncation mutants 
on NF-kB activity. Fig. IIA. Schematic representation of 
truncation mutants used in this assay. TM, transmembrane 
region; EXT-C, EXT-CCXDH domain; EXT-N, EXT-NHg domain. Fig. 
IIB. A 10 - ua a liquot of pcDNA/EXTLB, pcDNA/Z^ EXTL3 , 

20 pcDNA/AC EXTL3, or pcDNA/AN&C EXTL3 was transfected with 500 
ng pEIiAM-luc and 500 ng pRL-TK into HEK293 cells. After 12 
h, the cells were treated with (hatched column) or without 
(open column) 20 ng/ml TNF-a. At 12 h after stimulation, 
cell lysates were prepared and subjected to the dual 

25 luciferase assay. All values representing luciferase 
activities were normalized and are shown as the mecUi±SEM of 
six samples. Fig. IIC. A 5/iq_..of pcDNA/EXTL3, pcDNA/AN 
EXTL3, pcDNA/AC EXTL3, or pcDNA/AN&C EXTL3 and 5 jjg HA- 
tagged human TRAF2 construct (hatched column) or empty 

30 vector (open column) were transfected with 500 ng pELAM-luc 
and 500 ng pRL-TK into HEK293 cells. After 24 h, cell 
lysates were prepared and subjected to the dual luciferase 
assay. All values representing luciferase activities were 
normalized and are shown as the mean ±SEM of seven samples. 

35 Fig. IID. HBK293 cells cultured on cover glasses were 
transfected with pEGFP-N2 (a) , pEGFP/EXTL3 (b) , or pEGFP/AN 
EXTL3 (c) . After transfection, the cells were fixed with 
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3,7% formalin. Then, cells were treated with 0.2% Triton X- 
100. Fluorescence was imaged with a confocal laser scanning 
microscope. Bar, 50 jLtm. 

5 Figures 12A-12H* Effects of TRAFs on EXTL3 distribution 

HEK293 cells cultured on cover glasses were transfected with 
EGFP-tagged EXTL3 construct and FLAG-tagged TRAF2 (Figs. 
12A-12D) or TRAF3 (E-H) constructs. After transf ection, the 
cells were fixed with 3.7% formalin. Then, cells were 

10 treated with 0.2% Triton X-100. After blocking, indirect 
imrauno- fluorescence cuialysis was performed. Monoclonal 
anti-FLAG antibody was used as a first antibody followed by 
a Cy- 5 -conjugated second antibody. TRITC-concanavalin A was 
used to reveal the endoplasmic reticulxim region. 

15 Fluorescence was imaged with a confocal laser scanning 
microscope. EXTL3 is shown in green (Figs. 12A, 12E) . The 
concanavalin A-stained region is shown in red (Figs. 12B, 
12F) . Fig. 12C shows TRAF2 in white, and Fig. 12G shows 
TRAF3 in white. Fig. 12D is a merged image of Figs. 12A, 

20 12B, and 12C, and Fig. 12H shows a merged image of Figs. 
12E, 12F, and 12G. Bar, 10 jim. 



wo 00/18959 



P€TAJS99y21654 



-13- 

DETAILED DESCRIPTION OF THE INVENTION 

The following standard abbreviations are used throughout the 
specification to indicate specific nucleotides: 

5 

C=cytosine A^adenosine 
T^thymidine G=guanosine 



This invention provides an isolated nucleic acid molecule 
10 encoding a Tumor necrosis factor Receptor-Associated Factor 
(TRAF) protein- interacting hereditary multiple extoses 
(TREX) protein. 

As used herein, tumor necrosis factor Receptor -Associated 
15 Factor (TRAF) protein-interacting hereditary multiple 
extoses protein (TREX) is a protein first identified as a 
potential tumor suppressor gene involved in tumor necrosis 
factor receptor (TNFR) superfamily. Furthermore, TREX is a 
signal modulator which bridges between TNFR and CD40- 
20 mediated signal transduction « 

In an embodiment the above-described isolated nucleic acid 
molecule is a DNA molecule or a fragment thereof. In another 
embodiment the isolated DNA. molecule is a cDNA molecule. In 

25 a further embodiment the DNA molecule is a genomic DNA 
molecule. In an embodiment the nucleic acid molecule is an 
RNA molecule. In another embodiment the nucleic acid 
molecule encodes a mammalian Tumor necrosis factor Receptor- 
Associated Factor (TRAF) protein- interacting hereditary 

30 multiple extoses (TREX) protein or a functionally active 
fragment thereof, e.g. a motif that interacts with TRAF 
proteins, including but not limited to motifs such as an 
isoleucine zipper motif and an EXT-C domain. In an 
embodiment the encoded mammalian Tumor necrosis factor 

35 Receptor -Associated Factor (TRAF) protein-interacting 
hereditary multiple extoses (TREX) protein is human Tumor 
necrosis factor Receptor -Associated Factor (TRAF) protein- 
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interacting hereditary multiple extoses (TREX) protein. 

The DNA molecules of the subject invention also include DNA 
molecules coding for polypeptide analogs, fragments or 
5 derivatives of antigenic polypeptides which differ from 
naturally-occurring forms in terms of the identity or 
location of one or more amino acid residues (deletion 
analogs containing less than all of the residues specified 
for the protein, substitution analogs wherein one or more 

10 residues specified are replaced by other residues and 
addition analogs where in one or more amino acid residues is 
added to a terminal or medial portion of the polypeptides) 
and which share some or all properties of naturally- 
occurring forms. These molecules include: the incorporation 

15 of codons "preferred" for expression by selected non- 
mammalian hosts; the provision of sites for cleavage by 
restriction endonuclease enzymes; and the provision of 
additional initial, terminal or intermediate DNA sequences 
that facilitate construction of readily expressed vectors. 

20 

The DNA molecules described and claimed herein are useful 
for the information which they provide concerning the amino 
acid sequence of the polypeptide, TREX, and as products for 
the . large scale synthesis of the polypeptide (TREX) or 

25 fragments thereof (e.g. for the production of portions of 
the polypeptide encoding an isoleucine zipper motif, a 
hereditary multiple extoses C (EXT C) domain, or an 
isoleucine zipper motif and a hereditary multiple extoses C 
(EXT C) domain, portions which are involved in protein- 

30 protein interactions) by a variety of recombinant 
techniques. The molecule is useful for generating new 
cloning and expression vectors, transformed and transfected 
prokaryotic and eukaryotic host cells, and new and useful 
methods for cultured growth of such host cells capcQ>le of 

35 expression of the polypeptide (TREX) or portions thereof 
which comprise an isoleucine zipper motif and/or a 
hereditary multiple extoses C (EXT C) domain and related 
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products. 

In an embodiment the isolated nucleic acid molecule encoding 
the mammalian Tumor necrosis factor Receptor-Associated 
5 Factor (TRAP) protein- interacting hereditary multiple 
extoses (TREX) protein is a mouse, rat or human Tumor 
necrosis factor Receptor-Associated Factor (TRAF) protein- 
interacting hereditary multiple extoses (TREX) protein. In 
another embodiment the isolated nucleic acid molecule 

10 encodes a Tumor necrosis factor Receptor-Associated Factor 
(TRAF) protein- interacting hereditary multiple extoses 
(TREX) protein comprising an amino acid sequence as set 
forth in Figures 1 and 7B (SEQ ID NO: 2) • In an embodiment 
the isolated nucleic acid molecule encodes a mouse TREX 

15 protein. In another embodiment the isolated nucleic acid 
molecule encodes a Tumor necrosis factor Receptor-Associated 
Factor (TRAF) protein- interacting hereditary multiple 
extoses (TREX) protein comprising an amino acid sequence as 
set forth in Figures 1 and 8B (SEQ ID N0:4) . In an 

20 embodiment the isolated nucleic acid molecule encodes a 
human TREX protein. 

In an embodiment of the isolated nucleic acid molecule the 
encoded amino acid sequence comprises an isoleucine zipper 

25 motif and a herediteary multiple extoses C (EXT C) domain. 
In an emibodiment the isolated nucleic acid is a fragment of 
the above- described nucleic acid, said fragment encoding an 
isoleucine zipper motif, a hereditary multiple extoses C 
(EXT C) domain, or an isoleucine zipper motif and a 

30 hereditary multiple extoses C (EXT C) domain. In another 
embodiment the Tumor necrosis factor Receptor -Associated 
Factor (TRAF) protein- interacting hereditary multiple 
extoses (TREX) protein has sxabstantially the same amino acid 
sequence as set forth in Figures 1 and 7B (SEQ ID NO: 2) . 

35 In a preferred embodiment the Tumor necrosis factor 
Receptor- Associated Factor (TRAP) protein- interacting 
hereditary multiple extoses (TREX) protein has substantially 
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the same amino acid sequence as set forth in Figures 1 and 
8B (SEQ ID NO: 4) . In another embodiment the Tumor necrosis 
factor Receptor-Associated Factor (TRAP) protein- interacting 
hereditary multiple extoses (TREX) protein has the amino 
5 acid sequence as set forth in Figure 1 and 7B (SEQ ID NO: 
2) - Iii preferred embodiment the Tumor necrosis factor 
Receptor-Associated Factor (TRAF) protein- interacting 
hereditary multiple extoses (TREX) protein has the amino 
acid sequence as set forth in Figure 1 and 8B (SEQ ID NO: 
10 4) . 

This invention provides cui isolated nucleic acid molecule 
encoding a mutsuit homolog of the mammalian Tumor necrosis 
factor Receptor- Associated Factor (TRAF) protein- interacting 
hereditary multiple extoses (TREX) protein whose genetic 
alterations and resulting amino acid sequence (s) is set 
forth in Table 3, infra. In an embodiment the isolated 
nucleic acid molecule is a deletion mutant. In an embodiment 
of the deletion mutant the encoded mutant homolog comprises 
a tumor suppressor locus. In an embodiment of the deletion 
mutant the encoded mutant homolog does not comprise a tumor 
suppressor loc\as domain. In a further embodiment the above - 
described isolated nucleic acid molecule encoding the 
mammaliskn TREX protein comprises the genetic alterations and 
resulting amino acid sequence (s) as shown in TcdDle 3, 
infra . 

This invention provides a vector comprising the isolated 
nucleic acid molecule encoding a Tumor necrosis factor 
30 Receptor-Associated Factor (TRAF) protein- interacting 
hereditary multiple extoses (TREX) protein. In an embodiment 
the vector is adapted for expression in a host cell which 
comprises the regulatory elements necessary for expression 
of the nucleic acid molecule in the host cell operatively 
35 linked to the nucleic acid molecule encoding the Tumor 
necrosis factor Receptor-Associated Factor (TRAF) protein- 
interacting hereditary multiple extoses (TREX) protein so 



15 



20 



25 
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as to permit expression of the TREX protein. In another 
embodiment of the vector the host cell is a eukaryotic, 
bacterial; insect or yeast cell. In an embodiment of the 
vector the eukaryotic host cell is a mammalian cell. In a 
5 further embodiment the vector is a plasmid. In another 
embodiment of the vector comprising the nucleic acid 
encoding a Tumor necrosis factor Receptor -Associated Factor 
(TRAP) protein- interacting hereditary multiple extoses 
(TREX) protein the nucleic acid molecule is a DNA molecule. 

10 In an embodiment the DNA molecule is a cDNA molecule. In 
further embodiments, any of the above-described vectors are 
adapted for e^qpression in a host cell which comprises the 
regulatory elements necessary for e3q>ression of the nucleic 
acid molecule in the host cell operatively linked to the 

15 nucleic acid molecule encoding the Tumor necrosis factor 
Receptor -Associated Factor (TRAP) protein- interacting 
hereditary multiple extoses (TREX) protein as to permit 
expression of the TREX protein. In an embodiment of the 
vector, the host cell is a eukaryotic, bacterial, insect or 

20 yeast cell. In another embodiment of the vector, the 
eukaryotic host cell is a mammalian cell. In a further 
embodiment of the vector is a plasmid. 

Numerous vectors for expressing the inventive proteins may 
25 be employed. Such vectors, including plasmid vectors, 
cosmid vectors, bacteriophage vectors and other viruses, are 
well known in the art. For example, one class of vectors 
utilizes DNA elements which are derived from animal viruses 
such as bovine papilloma virus, polyoma virus, adenovirus, 
30 vaccinia virus, baculovirus, retroviruses (RSV, MMTV or 
MoMLV) , Semliki Forest virus or SV40 virus. Additionally, 
cells which have stably integrated the DNA into their 
chromosomes may be selected by introducing one or more 
markers which allow for the selection of transfected host 
35 cells. The markers may provide, for example, prototrophy to 
an auxotrophic host, biocide resistance or resistance to 
heavy metals such as copper. The selectcQ^le marker gene can 
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be either directly linked to the DNA sequences to be 
expressed, or introduced into the same cell by 
cot ransf ormat ion . 

5 Regulatory elements required for expression include promoter 
sequences to bind RNA polymerase and transcription 
initiation sequences for ribosome binding. Additional 
elements may also be needed for optimal synthesis of mRNA. 
These additional elements may include splice signals, as 

10 well as enhancers and termination signals. For example, a 
bacterial expression vector includes a promoter such as the 
lac promoter and for treinscription initiation the Shine- 
Dalgamo sequence and the start codon AUG. Similarly, a 
eukaryotic expression vector includes a heterologovis or 

15 homologous promoter for RNA polymerase II, a downstream 
polyadenylation signal, the start codon AUG, and a 
termination codon for detachment of the ribosome. Such 
vectors may be obtained commercially or assembled from the 
sequences described by methods well known in the art, for 

20 example the methods described above for constrxicting vectors 
in general. 

These vectors may be introduced into a suitable host cell to 
form a host vector system for producing the inventive 
25 proteins. Methods of making host vector systems are well 
known to those skilled in the art. 

Suitable host cells include, but are not limited to, 
bacterial cells (including gram positive cells) , yeast 

30 cells, fungal cells, insect cells and animal cells. 
Suitable animal cells include, but are not limited to HeLa 
cells, Cos cells, CVl cells and various primary mammalian 
cells. Numerous mammalicui cells may be used as hosts, 
including, but not limited to, the mouse fibroblast cell 

35 NIH-3T3 cells, CHO cells, HeLa cells, Ltk" cells and COS 
cells. Mammalian cells may be transfected by methods well 
known in the art such as calcium phosphate precipitation. 
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electroporation and microinjection. 

One of ordinary skill in the art will easily obtain xrnique 
sequences from the cDNA cloned in plasmids. Such unique 
5 sequences may be used as probes to screen various mammalian 
cDNA libraries and genomic DNAs, e.g. mouse, rat and bovine, 
to obtain homologous nucleic acid sequences and to screen 
different cDNA tissue libraries to obtain isoforms of the 
obtained nucleic acid sequences. Nucleic acid probes from 

10 the cDNA cloned in plasmids may further be used to screen 
other human tissue cDNA libraries to obtain isoforms of the 
nucleic acid sequences encoding TRKX as well as to screen 
human genomic DNA to obtain the sunalogous nucleic acid 
sequences. The homologous nucleic acid sequences and 

15 isoforms may be used to produce the proteins encoded 
thereby. 

This invention provides a method of producing a host cell 
operatively linked to the nucleic acid molecule encoding a 

20 Tumor necrosis factor Receptor -Associated Factor (TRAF) 
protein- interacting hereditary multiple extoses (TREX) 
protein, which comprises growing a host cell comprising any 
of the above -described vectors lander suitable conditions 
peannitting production of the TREX protein and recovering the 

25 TREX protein so produced. In an embodiment the method 
further comprising purifying the recovered TREX protein. 

This invention provides a method of producing a polypeptide 
having the biological activity of a protein encoded by the 

30 nucleic acid molecule encoding a Tumor necrosis factor 
Receptor-Associated Factor (TRAF) protein-interacting 
hereditary multiple extoses (TREX) protein which comprises 
growing any of the above -described host cells under suitable 
conditions permitting production of the polypeptide and 

35 recovering the polypeptide so produced. In an embodiment 
the method further comprises purifying the recovered 
polypeptide. 
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This invention provides a purified mammalian Tumor necrosis 
factor Receptor-Associated Factor (TRAP) protein- interacting 
hereditary multiple extoses (TREX) protein. In an embodiment 
the purified mammalian Tumor necrosis factor Receptor- 
5 Associated Factor (TRAF) protein- interacting hereditary- 
multiple extoses (TREX) protein is a human TREX protein . 

This invention provides a protein comprising substantially 
the amino acid sequence set forth in Figure 1, 

10 

This invention provides an oligonucleotide comprising a 
nucleic acid molecule of at least 15 nucleotides capcd>le of 
specifically hybridizing with a unique sequence included 
within the sequence of an isolated nucleic acid molecule 

15 encoding a Tumor necrosis factor Receptor -Associated Factor 
(TRAF) protein- interacting hereditary multiple extoses 
(TREX) protein. In an embodiment of the oligonucleotide the 
nucleic acid is DNA. In another embodiment of the 
oligonucleotide, the nucleic acid is RNA. In an embodiment 

20 the oligonucleotide comprises a nucleic acid molecule of at 
least IS contiguous nucleotides capable of specifically 
hybridizing with a unique sequence included within the 
sequence of an isolated nucleic acid molecule encoding a 
Tumor necrosis factor Receptor-Associated Factor (TRAF) 

25 protein- interacting hereditary multiple extoses (TREX) 
protein . 

This invention provides an antisense oligonucleotide 
comprising a sequence capable of specifically hybridizing 
30 with a luiique sequence included within an mRNA molecule 
encoding a Tumor necrosis factor Receptor-Associated Factor 
(TRAF) protein- interacting hereditary multiple extoses 
(TREX) protein. 

35 This invention provides an antisense oligonucleotide 
comprising a sequence capable of specifically hybridizing 
with a unique sequence included within a genomic DNA 
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molecule encoding a Tumor necrosis factor Receptor- 
Associated Factor (TRAP) protein-interacting hereditary 
multiple extoses (TREX) protein. 

5 This invention provides an antibody capable of binding to 
any of the above -described mammalian Tumor necrosis factor 
Receptor -T^sociated Factor (TRAF) protein- interacting 
hereditary multiple extoses (TREX) proteins. In an 
embodiment the antibody is a monoclonal antibody. In 
10 another embodiment the antibody is a polyclonal antibody. 

This invention provides a monoclonal antibody directed to an 
epitope of a Tumor necrosis factor Receptor -Associated 
Factor (TRAF) protein- interacting hereditary multiple 
15 extoses (TREX) protein. 

Polyclonal antibodies may be produced by injecting a host 
animal such as rabbit, rat, goat, mouse or other animal with 
the immunogen of this invention, e.g. a purified mammalian 

20 TREX or a purified human TREX. The sera are extracted from 
the host animal and are screened to obtain polyclonal 
antibodies which are specific to the immunogen. Methods of 
screening for polyclonal antibodies are well known to those 
of ordinary skill in the art such as those disclosed in 

25 Harlow & Lane, Antibodies: A Laboratory Manual . (Cold Spring 
Harbor Laboratories, Cold Spring Harbor, NY: 1988) the 
contents of which are hereby incorporated by reference. 

The monoclonal antibodies may be produced by immunizing for 
30 example, mice with an immunogen. The mice are inoculated 
intraperitoneal ly with an immunogenic amount of the above- 
described immunogen and then boosted with similar amounts of 
the immunogen. Spleens are collected from the immunized mice 
a few days after the final boost and a cell suspension is 
35 , prepared from the spleens for use in the fusion. 

Hybridomas may be prepared from the splenocytes and a murine 
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tutnor partner using the general somatic cell hybridization 
technique of Ktohler, B. and Milstein, C, Nature (1975) 256: 
495-497. Available murine m/eloma lines, such as those from 
the Americcui Type Culture Collection (ATCC) 12301 Parklawn 
5 Drive, Rockville, MD 20852 USA, may be used in the 
hybridization. Basically, the technique involves fusing the 
tumor cells and splenocytes using a fusogen such as 
polyethylene glycol. After the fusion the cells are 
separated from the fusion medium and grown in a selective 

10 growth medium, such as HAT meditmi, to eliminate unhybridized 
parent cells. The hybridomas may be expanded, if desired, 
and supernatant s may be assayed by conventional immunoassay 
procedures, for example radioimmunoassay, using the 
immunizing agent as antigen. Positive clones may be 

15 characterized further to determine whether they meet the 
criteria of the invention antibodies. 

Hybridomas that produce such antibodies may be grown in 
vitro or in vivo using known procedures- The monoclonal 
20 antibodies may be isolated from the culture media or body 
fluids, as the case may be, by conventional immunoglobulin 
purification procedures such as ammonium sulfate 
precipitation, gel electrophoresis, dialysis, 

chromatography, and ultrafiltration, if desired. 

25 

In the practice of the subject invention any of the above- 
described antibodies may be labeled with a detectable 
marker. In one embodiment, the labeled antibody is a 
pxirified labeled antibody. The term "antibody" includes, by 

30 way of example, both naturally occurring and non-naturally 
occurring antibodies. Specifically, the term "antibody" 
includes polyclonal and monoclonal antibodies, and fragments 
thereof. Furthermore, the term "antibody" includes chimeric 
antibodies and wholly synthetic antibodies, and fragments 

35 thereof. A "detectable moiety" which functions as detectcQjle 
labels are well known to those of ordinaiy skill in the art 
and include, but are not limited to, a fluorescent label, a 
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radioactive atom, a paramagnetic ion, biotin, a 
chemiluminescent label or a label which may be detected 
through a secondasry enzymatic or binding step. The 
secondary enzymatic or binding step may con5)rise the use of 
5 digoxigenin, alkaline phosphatase, horseradish peroxidase, 
S-galactosidase, fluorescein or steptavidin/biotin. Methods 
of labeling antibodies are well known in the art. 

This invention provides a method of inhibiting TREX protein 
10 interaction with a TRAP protein comprising administering a 
ligand comprising an amino acid domain which binds to a EXT 
C domain of the TREX protein so as to inhibit binding of the 
TREX protein to the TRAP protein. In an embodiment the TREX 
protein is a mammalian protein. In a prefeanred embodiment, 
15 the TREX protein is a hvmian protein. 

Inhibition of the TREX protein interaction with a URAP 
protein may prevent TRAP induced NF-kB activation. 
Accordingly the above -described method may be used to 
20 control cell differentiation, cell proliferation, and 
apoptosis (programmed cell death) . Accordingly, this method 
would be used to treat diseases such as cancer, autoimmune 
diseases and inflammation by inhibiting tumor cell growth 
and differentiation. 

25 

As used herein ligands conprising an amino acid domain which 
binds to a TREX protein, which binds to a TRAP binding 
domain, or which block TRAP binding are defined as cui amino 
acid molecule or fragment thereof which has an amino acid 
30 sequence complementary to a TREX protein. 

This invention provides a method of inhibiting 
overexpression of TREX protein comprising administering any 
of the above-described antisense oligonucleotides which bind 
35 to an mRNA molecule encoding a human Tumor necrosis factor 
Receptor-Associated Pactor (TRAF)„ protein- interacting 
hereditary multiple extoses (TREX) protein so as to inhibit 
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overeaqpression of the human TREX protein . 

In an embodiment of the above -described method inhibiting 
overe3<pression of TREX protein thereby inhibits TRAF-induced 
CD40 signal dependent NF-kB activation. Accordingly the 
above -described method may be used to control cell 
differentiation, cell proliferation, and apoptosis 
(programmed cell death) . Accordingly, this method would be 
used to treat diseases such as cancer, autoimmune diseases 
and inflairanation by inhibiting tumor cell growth and 
differentiation. 

In another embodiment of the above-described method the 
ligand is an antibody capable of binding to the TREX 
protein. In a further embodiment of the above -described 
method the antibody is a monoclonal or a polyclonal 
antibody. 

This invention provides a method of inhibiting growth of a 
tumor cell comprising blocking a TRT^ interacting site of a 
TREX protein by administering a ligand capable of binding to 
the TRAP interacting site of a TREX protein. 

In an embodiment of the above-described method, the TRAP 
interacting site is a hereditary multiple extoses C (EXT C) 
domain. In coiother embodiment the tumor cell growth is 
inhibited in vivo or in vitro . In a further embodiment the 
ligand is an antibody capable of binding to the TRAP 
interacting site of a TREX protein. In still further 
embodiments the antibody is a monoclonal or a polyclonal 
antibody. 

This invention provides a pharmaceutical composition 
comprising an amoimt of any of the above -described 
oligonucleotides effective to prevent overexpression of a 
TREX protein and a pharmaceutically acceptable carrier 
capable of passing through a cell membrane. 
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This invention provides a pharmaceutical composition 
comprising an amount of any of the above-described 
antibodies effective to block binding of a TREX protein to 
a TRT^ protein and a pharmaceutically acceptable carrier 
5 capable of passing through a cell membrane. 

This invention provides a method of administering the above- 
described pharmaceutical compositions comprising an amount 
of any of the above-described ligands, oligonucleotides or 

10 antibodies which are determined to be potentially 
therapeutic, wherein the administration is intravenous, 
intraperitoneal , intrathecal , intralymphatical , 

intramuscular, intralesional, parenteral, epidural, 
subcutaneous; by infusion, liposome -mediated delivery, 

15 aerosol delivery; topical, oral, nasal, anal, ocular or otic 
delivery. 

The present invention also provides a pharmaceutical 
composition comprising a effective amount of any of the 

20 above -described ligands, oligonucleotides or antibodies 
which are determined to be potentially therapeutic and a 
pharmaceutically acceptable carrier. In the s\ibject 
invention an "effective amount" is any amount of the above- 
described ligands, oligonucleotides or antibodies which are 

25 determined to be potentially therapeutic, which, when 
administered to a subject suffering from a disease or 
abnormality against which the above-described ligands, 
oligonucleotides or antibodies which are determined to be 
potentially therapeutic, are effective, causes reduction, 

30 remission, or regression of the disease or abnormality. In 
the practice of this invention the "pharmaceutically 
acceptable carrier" is any physiological carrier known to 
those of ordinary skill in the art useful in formulating 
pharmaceutical compositions. 

35 

In one preferred embodiment the pharmaceutical carrier may 
be a liquid and the pharmaceutical composition would be in 
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the form of a solution. In another equally preferred 
embodiment, the pharmaceutically acceptable carrier is a 
solid and the composition is in the foxm of a powder or 
tablet. In a ftirther embodiment, the pharmaceutical carrier 
is a gel and the composition is in the form of a suppository 
or cream. In a further embodiment the compound may be 
formulated as a part of a pharmaceutically acceptable 
transdermal patch. 

A solid carrier can include one or more substances which may 
also act as flavoring agents, lubricsuits, solubilizers, 
suspending agents, fillers, glidants, compression aids, 
binders or tablet -disintegrating agents; it can also be an 
encapsulating material. In powders, the carrier is a finely 
divided solid which is in admixture with the finely divided 
active ingredient. In tablets, the active ingredient is 
mixed with a carrier haying the necessary compression 
properties in suitable proportions and conpacted in the 
shape and size desired. The powders and tablets preferadDly 
contain up to 99% of the active ingredient. Suitable solid 
carriers include, for example, calcium phosphate, magnesium 
stearate, talc, sugars, lactose, dextrin, starch, gelatin, 
cellulose, polyvinylpyrrolidine, low melting waxes and ion 
exchange resins. 

Liquid carriers are used in preparing solutions, 
suspensions, emulsions, syrups, elixirs and pressurized 
compositions. The active ingredient can be dissolved or 
suspended in a pharmaceutically acceptable liquid carrier 
such as water, an organic solvent, a mixture of both or 
pharmaceutically acceptable oils or fats. The liquid 
carrier can contain other suitable pharmaceutical additives 
such as sol\ibilizers, emulsifiers, buffers, preservatives, 
sweeteners, flavoring agents, suspending agents, thickening 
agents, colors, viscosity regulators, stabilizers or osmo- 
regulators. Suitable examples of liquid carriers for oral 
and parenteral administration include water (partially 
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contalziing additives as above, e.g. cellulose derivatives, 
preferably sodium carboxymethyl cellulose solution) , 
alcohols (including monohydric alcohols and polyhydric 
alcohols, e.g. glycols) and their derivatives, and oils 
5 (e.g. fractionated coconut oil and arachis oil) . For 
parenteral administration, the carrier can also be an oily 
ester such as ethyl oleate and isopropyl tnyristate. Sterile 
liquid carriers are useful in sterile liquid form 
compositions for parenteral administration. The liquid 
10 carrier for pressurized compositions can be halogenated 
hydrocarbon or other pharmaceutically acceptable propellent. 

Liquid pharmaceutical compositions which are sterile 
solutions or suspensions can be utilized by for example, 

15 intramuscular, intrathecal, epidural, intraperitoneal or 
subcutaneous injection. Sterile solutions can also be 
administered intravenously. The compoimds may be prepared 
as a sterile solid composition which may be dissolved or 
suspended at the time of administration using sterile water, 

20 saline, or other appropriate sterile injectable medium. 
Carriers are intended to include necessaary- and inert 
binders, suspending agents, lubricants, flavorcuits, 
sweeteners, preservatives, dyes, and coatings. 

t- 

25 The above-described ligands, oligonucleotides or antibodies 
which are determined to be potentially therapeutic can be 
administered orally in the foinn of a sterile solution or 
suspension containing other solutes or suspending agents, 
for example, enough saline or glucose to make the solution 

30 isotonic, bile salts, acacia, gelatin, sorbitcoi monoleate, 
polysorbate 80 (oleate esters of sorbitol and its anhydrides 
copolymerized with ethylene oxide) and the like. 

The above-described ligands, oligonucleotides or antibodies 
35 which are determined to be potentially therapeutic can also 
be administered orally either in liquid or solid conposition 
form. Compositions suitable for oral administration include 
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solid forms I such as pills, capsxiles, granules, tablets, and 
powders, and liquid forms, such as solutions, syrups, 
elixirs, and suspensions. Forms useful for parenteral 
administration include sterile solutions, emulsions, and 
5 suspensions . 

Optimal dosages to be administered may be determined by 
those skilled in the art, and will vary with the particular 
ligands, oligonucleotides or antibodies in use, the strength 

10 of the preparation, the mode of administration, and the 
advancement of the disease condition or abnormality. 
Additional factors depending on the particular subject being 
treated will result in a need to adjust dosages, including 
subject age, weight, gender, diet, and time of 

15 administration. 

This invention provides a method of treating an abnormality 
in a subject, wherein the abnormality is alleviated by the 
inhibition of binding of a TREX protein and a TRAP protein 

20 which comprises administering to the subject an effective 
amount of the above described pharmaceutical composition 
effective to block binding of the TREX protein and the TRAP 
protein in the subject, thereby treating the abnormality in 
the subject. In an embodiment the TRAP protein is TRAF2, 

25 TRAF3 or TRAP 5. In a preferred embodiment the abnormality 
is Ccuicer, a hereditary multiple extosis or an autoimmune 
disease. In a further preferred embodiment the cancer is 
colon cancer, gastric cancer, human squamous cell carcinoma, 
prostate carcinoma, breast cancer, or papillary bladder 

30 cancer. 

This invention provides a method of treating an abnormality 
in a subject, wherein the abnormality is alleviated by the 
inhibition of overexpression of a TREX protein which 
35 comprises administering to the subject cui effective amount 
of the above -described pharmaceutical composition effective 
to inhibit overexpression of the TREX protein, thereby 
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treating the abnormality in the subject. In a preferred 
embodiment the abnormality is cancer, a hereditary multiple 
extosis or an autoimmune disease. In a further preferred 
embodiment the. cancer is colon cancer, gastric cancer, human 
5 head and neck squamous cell carcinoma, prostate carcinoma, 
breast cancer, thyroid cancer, esophageal cancer, lung 
cancer, colorectal cancer, ovarian cancer, papillary bladder 
cancer, osteosarcoma, chondrosarcoma, liposarcoma, giauit 
cell tumor, Ewing sarcoma, and other malignant tumors. 

10 

This invention provides a method of screening for a chemical 
compound which inhibits TREX protein and TRAF protein 
binding comprising: (a) incubating the chemical compound 
with a TREX protein and a TRAP protein; (b) contacting the 

15 incubate of step (a) with an affinity medium under 
conditions so as to bind a TREX protein-TRAF protein 
conqplex, if such a complex forms; and (c) measuring the 
amount of the TREX protein-TRAF protein complex formed in 
step (b) so as to determine whether the compound is capable 

20 of interfering with the formation of the complex between the 
TREX protein-TRAF protein. 

Additional methods for cui assay to screen for drugs which 
inhibit the TREX-TRAF binding which are known to one of 
25 ordinary skill in the art include but are not limited to the 
two-hybrid screening system using yeast and mammalicin cells 
(Fields, S. and O. Song, Nature, 340:245-246, 1989, the 
contents of which are hereby incorporated by reference) . 

30 In the above -described methods of screening for a chemical 
compound . which inhibits TREX protein and TRAP protein 
binding association conditions, including but not limited to 
low salt, pH, or temperature may be used to compare the 
amount of TREX-TRAF complex formed without incubation with 

35 the compound. 

In an eihbodiment the TEAF protein is TRAF2, TRAF3 or TRAF 5. 
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In a preferred embodiment the compound may be a CD40 
receptor ligand or a CD40 antibody. 

In a preferred embodiment of the above-described methods, 
5 the molecule is a peptide or a fragment thereof which 
comprises a TRAP binding domain. In further embodiments the 
TRAP protein is TRAP2, TRAF3 or TRAP 5. 

This invention provides a method of preventing inhibition of 
10 activation of a CD40 signal -dependent NF-kB activation 
comprising administering any of the above -described 
antisense oligonucleotides which bind to an mRNA molecule 
encoding a human Tumor necrosis factor Receptor-Associated 
Factor (TRAP) protein- interacting hereditary multiple 
15 extoses (TREX) protein so as to prevent inhibition of 
activation of CD40 signal -dependent NP-kB activation. 

This invention provides a method of preventing inhibition of 
activation of a CD40 signal -dependent NF-kB activation 
20 comprising administering a ligand comprising an amino acid 
domain which binds to a EXT C domain of the TREX protein so 
as to inhibit binding of the TREX protein to the TRAP 
protein, thereby preventing inhibition of activation of a 
CD40 signal -dependent NP-kB activation. 

25 

In a preferred embodiment of the above -described method the 
ligand is peptide or a fragment thereof which comprises a 
TRAP binding domain. 

30 This invention provides a method of detecting a 
predisposition to cancer which comprises detecting of a 
genetic alteration in a nucleic acid encoding TREX protein 
in the sample from the subject. In a preferred embodiment of 
the above -described method the mutation is a silent point 

35 mutation or a missense point mutation. In another preferred 
embodiment of the above -described method the genetically 
altered nucleic acid encoding TREX protein is detected by 
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contacting the nucleic acid from the sample with a TREX 
nucleic acid probe under conditions permitting the TREX 
nucleic acid probe to hybridize with the nucleic acid from 
the sanple, thereby detecting the genetic alteration in the 
5 nucleic acid encoding TREX protein in the sample. 

Methods of detecting genetic alterations in nucleic acid 
irolecules are well known to one of ordinary skill in the art 
and include but are not limited to methods such as single 
10 strand conformation polymorphism detection, RNase protection 
assay, and PGR direct sequencing. As used herein, genetic 
alterations in nucleic acid molecules which may be detected 
include point mutations, deletions, translocations, and 
insertions . 

15 

In other preferred embodiments the cancer is colon cancer, 
gastric cancer, human head and neck squamous cell carcinoma, 
prostate carcinoma, breast cancer, thyroid cancer, 
esophageal cancer, lung cancer, colorectal cancer, pvaricui 

20 cancer, papillary bladder cancer, osteosarcoma, 
chondrosarcoma, liposarcoma, giant cell tumor, Ewing 
sarcoma, and other malignamt tumors. In another preferred 
embodiment of the above -described method the TREX nucleic 
acid probe cotrprises a nucleic acid molecule of at least 15 

25 nucleotides which specifically hybridizes with a unique 
sequence included within the sequence of an isolated nucleic 
acid molecule encoding a Tumor necrosis factor Receptor- 
Associated Factor (TRAP) protein- interacting hereditary 
multiple extoses (TREX) protein. In an embodiment of the 

30 TREX nucleic acid probe the nucleic acid is DNA. In another 
embodiment of the TREX nucleic acid probe the nucleic acid 
is RIIA. 

This invention provides a TREX nucleic acid probe comprising 
35 a sequence capable of specifically hybridizing with a unique 
sequence included within the above -described isolated DNA 
molecule encoding a Tumor necrosis factor Receptor- 
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Associated Factor (TRAP) protein- interacting hereditaxry 
multiple extoses (TREX) protein. In an embodiment the 
nucleic acid probe comprises a nucleic acid molecule of at 
least 15 contiguous nucleotides capable of specifically 
5 hybridizing with a xinique sec[uence included within the 
sequence of the isolated nucleic acid molecule encoding a 
Tumor necrosis factor Receptor-Associated Factor (TRAF) 
protein- interacting hereditary multiple extoses (TREX) 
protein- In a further embodiment the TREX is mammalian 
10 protein. In an embodiment the mammalian TREX protein is 
mouse protein* In a preferred embodiment the mammalian TREX 
protein is human protein. 

This invention provides a TREX nucleic acid probe comprising 
15 a sequence capable of specifically hybridizing with a unique 
sequence included within the above -described isolated mRNA 
molecule encoding a Tumor necrosis factor Receptor- 
Associated Factor (TRAF) protein- interacting hereditary 
multiple extoses (TREX) protein. In an embodiment the 
20 nucleic acid probe comprises a nucleic acid molecule of at 
least 15 contiguous nucleotides capable of specifically 
hybridizing with a luiique sequence included within the 
sequence of the isolated nucleic acid molecule encoding a 
Tumor necrosis factor Receptor-Associated Factor (TRAF) 
25 protein- interacting hereditary multiple extoses (TREX) 
protein. In a further embodiment the TREX is mammalian 
protein. In an eihbodiment the mammalian TREX protein is 
mouse protein. In a preferred embodiment the mammalian TREX 
protein is human protein - 

30 

This invention provides a TREX nucleic acid probe comprising 
a sequence capable of specifically hybridizing with a unique 
sequence included within the above -described isolated 
genomic DNA molecule encoding a Tumor necrosis factor 
35 Receptor-Associated Factor (TRAF) protein- interacting 
hereditary multiple extoses (TREX) protein. In an 
embodiment of the method the mutation comprises a portion of 
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a tumor suppressor locus. In an embodiment the nucleic acid 
probe comprises a nucleic acid molecule of at least 15 
contiguous nucleotides capable of specifically hybridizing 
with a unique sequence included within the sequence of the 
5 isolated nucleic acid molecule encoding a Tumor necrosis 
factor Receptor-Associated Factor (TRAP) protein- interacting 
hereditary multiple extoses (TREX) protein. In a further 
embodiment the TREX is mammalian protein. In an embodiment 
the mammalian TREX protein is mouse protein. In a preferred 
10 embodiment the mammalian TREX protein is human protein. 

This invention provides a method of diagnosing cancer in a 
subject which comprises: a) obtaining DMA from the sample of 
a subject suffering from cancer; b) performing a restriction 

15 digest of the DNA with a panel of restriction enzymes; c) 
separating the resulting DNA fragments by size 
fractionation; d) contacting the resulting DNA fragments 
with a nucleic acid probe capable of specifically 
hybridizing with a unique sequence included within the 

20 sequence of a genetically altered nucleic acid molecule 
encoding a TREX protein, wherein the nucleic acid is labeled 
with a detectable marker; e) detecting labeled bands which 
have hybridized to the nucleic acid probe in step (d) , 
wherein the sequence of a genetically altered nucleic acid 

25 molecule encoding a TREX protein creates a unique band 
pattern specific to the DNA of subjects suffering from 
cancer; f) preparing DNA obtained from a sample of a subject 
for diagnosis by steps (a-e) ; and g) corrparing the detected 
band pattern specific to the DNA obtained from a sample of 

30 subjects suffering from cancer from step (e) and the DNA 
obtained from a sample of the siibject for diagnosis from 
step (f) to determine whether the patterns are the same or 
different and to diagnose thereby predisposition to cancer 
if the patterns are the same. 

35 

As used herein, genetic alterations in nucleic acid 
molecules which may be detected include point mutations, 
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deletions, translocations, and insertions. 

In an embodiment of the above -described method the size 
fractionation in step (c) is effected by a polyacrylamide or 
agarose gel- In another embodiment of the method the 
detectable marker is radioactive isotope, enzyme, dye,, 
biotin, a fluorescent label or a chemi luminescent label. In 
a preferred embodiment of the above -described method, cancer 
associated with the expression of a mutated TREX protein is 
diagnosed. In further preferred embodiments of the above- 
described method the cancer is colon cancer, gastric cancer, 
humcin head and neck squamous cell carcinoma, prostate 
carcinoma, breast cancer, thyroid cancer, esophageal cancer, 
lung cancer, colorectal cancer, ovarian cancer, papillary 
bladder cancer, osteosarcoma, chondrosarcoma, liposarcoma, 
giant cell tumor, Bwing sarcoma, and other malignant tumors. 

This invention provides a method of diagnosing cancer in a 
subject which con?>rises: a) obtaining RNA. from the sample of 
the subject suffering from cancer; b) separating the RNA 
sample by size fractionation; c) contacting the resulting 
RNA species with a nucleic acid probe capeible of 
specifically hybridizing with a unique sequence included 
within the sequence of a nucleic acid molecule encoding a 
mutated TREX protein, wherein the sequence of the nucleic 
acid molecule encoding the mutated TREX protein is labeled 
with a detectable marker; d) detecting labeled bands which 
have hybridized to the RNA species to create a unique band 
pattern specific to the RNA of subjects suffering from 
cancer; e) preparing RNA obtained from a sample of a sxabject 
for diagnosis by steps (a-d) ; and f ) comparing the detected 
band pattern specific to the RNA obtained from a sample of 
subjects suffering from cancer from step (d) and the RNA 
obtained from a sample of the subject for diagnosis from 
step (f ) to determine whether the patterns are the same or 
different and to diagnose thereby predisposition to cancer 
if the patterns are the same. In an embodiment of the 
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method the size fractionation in step (c) is effected by a 
polyacrylamide or agarose gel. In another embodiment of the 
method the detectable marker is radioactive isotope, enzyme, 
dye, biotin, a fluorescent label or a chemilximinescent 
5 label. In a preferred embodiment of the above-described 
method, cancer associated with the expression of a mutated 
TREX protein is diagnosed. In further preferred embodiments 
of the above -described method the cancer is colon cancer, 
gastric cancer, human squamous cell carcinoma, prostate 
10 carcinoma, breast cancer, or papillary bladder cancer. 

This invention will be better understood from the 
Experimental Details which follow. However, one skilled in 
the art will readily appreciate that the specific methods 
15 and results discussed are merely illustrative of the 
invention as described more fully in the claims which follow 
thereafter . 

FIRST SBRXES OF EXPERIMENTS 

20 

Tumor necrosis factor (TNF) receptor-associated factor 
(TRAF) proteins contribute to signal transduction induced by 
TNF receptor family signaling. TRAF3 cloned as binding 
protein to the cytoplastic domain of CD40, a member of TNF 

25 receptor superfamily, is believed to be involved in 
signaling pathway induced by CD40, Lymphotoxin (LT) fi 
receptor, CD30 ligation (1-7) . Here molecular cloning of a 
novel TRAF- interacting protein named as TREX because of 
TOAF- interacting EXT (hereditary multiple exostoses) gene 

30 family protein is reported. TREX has a highly homologous 
sequence to the EXT gene family, a candidate of tumor 
suppressor gene (20-22) . TREX strongly interacts with TRAF2 
and TRAF3, and TREX and TRAF protein colocalize in mammalian 
cells. Moreover, overexpression of TREX inhibited NF-kB 

35 activity induced by TRAF-mediated signaling. These findings 
indicate that TREX and the other EXT gene family proteins 
can function as a mediator in receptor signaling and could 
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be involved in tuniorigenesis . 

CTPTOTMR HTAL DETAILS 
METHODS AND MATERIALS 

5 

Two -hybrid screening 

Two -hybrid screening was performed in yeast L40 (MATa) 
strain cells with plasmid pBTM116 containing human TRAF3 

10 (amino acids 82-543) subcloned in frame with the LexA as a 
bait and a mouse embryo cDNA library cloned into pVP16 as 
described previously (36) . In order to obtain the clones 
containing cDNA encoding protein which binds specifically to 
TRAF3, clones that formed on histidine-def icient media and 

15 produced a blue reaction product with X-gal in filter assays 
(37) were cured of the LexA-TRTVFS plasmid by growing cells 
in tryptophan- containing medium, and then mated against a 
panel of yeast strains NA87-11A (MATa) containing plasmid 
pBTMllS that produce LexA fusion protein with lamin. Fas and 

20 CD40. Mated cells were selected for growth in medium lacking 
tryptophan and leucine, and subsequently tested for the 
ability to trans-activate a lacZ reporter gene by growing 
cells on histidine-deficient media and a &-Gal colometric 
assay (37) . 

25 

Northern blot analysis 

Human and mouse Multiple Tissue Northern Blots (Clontech) 
were probed with human and mouse TREX cDNA, respectively. 

30 

Plasmid construction and transfection 

Full length coding regions of TRAFs, TREX and their mutants 
were atiplified by PGR and subcloned into FLAG-tagged pCR3.1 
35 or myc -tagged pcDNA3.1 (Invitrogen) . Mouse CD40 and CD40L 
were also amplified by PGR and subcloned into pMIKHygB. 293 
cells and 293 T cells were trctnsfected by standard calcium 
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phosphate coprecipitation method. COS cells were transfected 
by use of FuGENE 6 (Boehringer Mannheim) . 

Production of antl-TREX, imminoprecipltatlon and western 
blot analysis 

Rabbit anti-TREX polyclonal antibody raised against a 
recombinant Glutathion S- transferase- fused mouse TREX 
protein. 293T cells (2 x 10* cells) were transfected with the 
indicated plasmids. After transf action (40 hours) , cell 
lysates were prepared in Lysis buffer (20 mM Tris (pH 7.6), 
150 TiM NaCl, 1 % Triton X-100, 1 mM EDTA (pH 8.0), 10 /ig/ml 
of aprotinin, 10 /xg/ml of leupeptin, 5 mM Benzamidine and 1 
ixM PMSF) and incubated with indicated cUitibodies and 25//1 of 
50% slurry of protein G-Sepharose. Immunoprecipitates were 
detected by Western blot analysis using the indicated 
antibody. To detect endogenous TREX protein, cell lysates of 
human colon carcinoma cell line KM12L4 were 
immunoprecipitated with anti-TREX antibody and detected by 
Western blot analysis using anti-TREX antibody. 

Innmmohlstocheiiiistry 

COS7 cells were transfected with TRAF3 or myc-tagged TREX. 
After transf ection (40 hours) , cells were fixed with 
methanol. For detection of TREX protein, Anti-myc antibody 
(9E10, BIOMOL) and Phycoerythrin- ant i -mouse IgG (Chemicon) 
were used for 1st and 2nd antibody, respectively. For 
detection of TRAF protein, anti-TRAF3 antibody (Santa Cruz) 
and FITC-anti-rabbit IgG (Santa Cruz) were used for 1st and 
2nd antibody, respectively. 

Reporter gene assay 

293 cells (1 X 10* cells) were transfected with 
NF-KB-dependent reporter gene (pkBtkLuc) , the indicated 
plasmids and pRL-CMV (Promega) for normalization of 
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transfection efficiency as described previously (2) . After 
transfection (40 hours) , the cell lysates were prepared and 
luciferase activity measured using Dual -lucif erase reporter 
assay system (Promega) . 

BXPERIMF MT"- P ^SULTS AMD DISCUSSIOW 

TNF receptor- associated factor (TRAP) protein family members 
have been reported to contribute to TNF receptor- initiated 
signaling through direct binding to the cytoplasmic region 
of receptors, resulting in the activation of many signaling 
molecules such as transcription factor NF-kB, mitogen- 
activated protein kinase (MAPK) , although TRAPl and TRAP4 
have not been implicated clearly (2, 8-13). Overexpression 
of TRAF2 activates NF-kB and JNK/SAPK via NF-kB- inducing 
kinase (NIK) -dependent pathway and -independent pathway, 
respectively (14-16) . TRAF5 activates NF-kB and TRAF6 
activates NF-kB and ERK/MAPK pathway (2, 9-12) . Although 
TRAF2 is implicated to be required for protection against 
TNF- induced apoptosis via NF-kB- independent pathway (17, 
IB) , TRAF5 or TRAF6 could act to activate NF-kB pathway in 
place of TR7^2. These observations suggest that action of 
TRAP proteins seem to be regulated properly in response to 
each receptor signaling for the expression of receptor 
functions. On the other hand, overexpression of TRAF3 has 
been demonstrated to suppress the activation of NF-kB and 
ERK/MAPK induced by CD40 crosslinking (2, 8) . TRAF3 is 
implicated to be required for postnatal development and 
T-dependent immune responses (19), but no plausible 
signaling pathways or molecules via TRAF3 which lead to 
explain these biological functions were reported so far, in 
turn, the specificity and function of TRAF3 -mediated 
signaling are still unclear. 

Analyzing the signaling molecules downstream of TRAF3 would 
provide an understanding of the function of TRAF3 and its 
specificity. To identify the signaling molecules which 
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Specifically bind to TRAP3, two-hybrid screening of a mouse 
embryo cDNA library was performed using TRAF3 (amino acids 

82-543) as a bait. In this screening, multiple cDNA clones 
encoding several kinds of proteins were identified by 
5 seG[uencing. One clone among these positive clones, showed a 
putative isoleucine zipper motif in its sequence (Pig. la) . 
On the basis of a partial sequence, marathon PGR 
anplif ication and 5'-RACE methods were carried out, and a 
mouse full length sequence with an open reading frame of 

10 2,757 bp, which encodes a 918 amino acid peptide, was 
obtained (Fig- la) . Human full length cDNA with an open 
reading frame of 2,760 bp, which encodes a 919 amino acid 
peptide with 96.8 % identity to the mouse sequence, was also 
identified by screening of a human fetal brain cDNA library 

15 and the 5'-RACB method (Fig. la) . A BLAST data base search 
revealed that the C-terminal region of these clones shows 
significant homology to hereditary multiple exostoses (EXT) 
gene family proteins such as EXTl , EXT2 , EXTLl . EXTL2 and C. 
elegans rib- 2 (Fig. lb) (20-25) . Therefore, this new gene 

20 was designated as TREX (for TOAF- interacting EXT gene family 
protein) . Based on homology searches among EXT family 
proteins including TREX, permitted designating the highly 
homologous C-terminal regions as EXT domains, which are 
divided into two domains, EXT-N and EXT- C domains (Fig. Ic, 

25 d) . These new conserved regions might function as signaling 
mediators by protein-protein interaction. Surprisingly, 
human and mouse TREX have significant homology to C. elegans 
rib-2 (Fig. 1 c, d) in not only the EXT domain but the 
region between the EXT-N and the EXT-C domains (33 %, data 

30 not shown) . This obsearvation suggests that TREX protein 
plays a critical role in development beyond species. 

Next the expression of TREX mRNA and protein was examined. 
Northern blot analysis revealed about 7.0 kilobases 
35 transcript of TREX expressed in various tissues, with high 
expression in brain, heart, skeletal muscle (Fig. le) . To 
examine the endogenous TREX protein in mammalian cells, cell 
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lysates of human colon carcinoma cell line KM12L4 were 
immunoprecipitated with either a normal rabbit IgG or a 
rabbit anti-TREX antibody. Anti-TREX antibody detected a 
specific band at about 107 kDa, which is consistent with the 
5 predicted molecular weight of full length TREX (Pig. If) - 

As TREX has cloned as TRAF3 -binding protein, the binding 
specificity to TRAP family proteins was examined. The 293T 
cells were transfected with TREX and TRAP expression 

10 plasmids. Coimmunoprecipitation experiments indicated that 
not only TRAF3 but also TRAF2 strongly and TRAF5 weakly 
binds to TREX (Pig. 2a) . This observation leads to the 
consideration that TRAP proteins interact with TREX through 
TRAP domain, which is comparatively conserved among TRAP 

15 proteins, and that TREX cuid TRAP protein should colocalize 
in the cells. To examine the localization of TREX protein 
and TRAF3 protein, C0S7 cells were transfected with TREX or 
TRAP3 expression plasmids. TRAPS protein localized in 
cytoplasm, especially the region outside of the nuclear 

20 membrane, and TREX also localized around the nuclear 
membrane (Pig. 2b) . These results suggest that TREX and 
TRAP proteins are physically associated in mammalian cells. 

The interaction of TREX and TRAF2 or TRAPS indicated that 
25 TREX could be involved in TRAP-mediated signaling. 
Therefore, whether the expression of TREX protein could 
affect NP-kB activation induced by several stimulation was 
tested. 29S cells were transfected with TREX with CD40 and 
CD40 ligamd in the presence of a NP-KB-specif ic reporter 
30 gene. As shown in Fig. 3, CD40 signal -dependent NF-kB 
activation was inhibited by overexpression of TREX in a dose 
dependent manner, indicating that TREX could contribute to 
NF-kB pathway induced by CD40 ligation. Next, applicant 
examined whether TREX is involved in NF-kB activation 
35 mediated TRAF2 or not. 

Overexpression of TREX upregulated TRAP2- induced NP-kB 
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activation (Pig. 4) . These results suggest that TREX acts as 
a negative regulator of NP-kB pathway by direct interaction 
with TRAF2 in TNF receptor type II signaling. 
TRT^- interacting proteins TANK/I -TRAP and TRIP proteins, 
5 which inhibit NP-kB activity induced by TNF receptor family 
stimulation, were cloned by two-hybrid screening (26-28) . 
TRIP protein was proposed to be regulated by switching with 
antiapoptotic protein such as c-IAP in response to the 
signals leading to cell activation or cell death (26) . 
10 However, as the biological function of these proteins in 
TRAF-mediated signaling is still imknown, it is important to 
further analyze the activity of several signaling molecules. 

Demonstrated here is the identification of a novel 
15 TRT^- interacting protein, TREX, and the contribution of TRBX 
protein in CD40/TNP receptor type II signaling mediated by 
TRAP family. Furthermore, the sequence of this new protein 
TREX revealed a high homology to the EXT gene family and 
novel domains named EXT-N and EXT-C domains. This conserved 
20 sequence in the EXT domain suggests that the EXT domain 
might contribute to protein-protein interaction. Whether the 
EXT domain of the other EXT gene family proteins is involved 
in protein-protein interaction or not is now under 
investigation . 

25 

EXT gene family proteins, EXTl and EXT2 have been cloned by 
positional cloning on the basis of linkage analysis in 
inf03rmative exostoses families (20-22) . Some mutation was 
found in these genes, suggesting these genes should be 

30 candidate genes responsible for EXT (20-22, 29-31) . Three 
loci have been localized. The EXTl and EXT2 were localized 
on chromosome 8q24.1, llpll-13, respectively (20, 32, 33), 
and the third gene EXT3 on 19p was not identified (34) . Also 
identified was the chromosomal localization of human TREX on 

35 chromosome 8pll-12 (Shao et al., submitted), excluding TREX 
as a Ccuididate gene for EXT3. It is inportant to investigate 
whether TREX could be responsive to EXT or BXT-related 
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diseases. EXT family protein has been suggested to be a 
tumor suppressor gene because previous reports showed that 
multiple mutation in chondrosarcoma from sporadic tumors and 
tumors derived from malignant degeneration of exostoses (31, 
5 35) . Also identified was some infrequent mutation in TREX 
gene in some tumors (Shao et al., submitted), suggesting 
TREX might contribute to prevention of abnormal development 
such as transformation and tumorigenesis . The mutation of 
TREX gene in many kinds of tumor sanples is being surveyed. 

10 

Not only mammals but also species such as C.elegcucis which 
lack bone in their body have homologous genes to the EXT 
gene family according to EST database search (25) , 
suggesting that the EXT family proteins play an important 
15 role in development except bone development. A TREX-knockout 
mouse and rib- 2 -knockout C. elegans cure being made . Knockout 
of EXT gene family genes in these species will facilitate an 
understanding of their function and their importance during 
development . 

20 

Five EXT gene family proteins were identified but the 
fiinction of these gene products has been unknown. In this 
study, it is shown for the first time that an EXT family 
protein, TREX, acts as a signaling molecule mediating TNF 

25 receptor superfamily (Figs. 3,4) . Also shown is that the 
BXT-domain of TREX interacts with TRAF proteins^ which 
mediate receptor signaling through direct binding. These 
findings imply that the other EXT proteins could act as 
signaling mediators in receptor signaling. As TREX and the 

30 other EXT family proteins are easily thought to be involved 
in receptor signaling, the development of inhibitor (s) of 
signaling cascades related to TREX or the other EXT family 
pTOteins will be used to design drugs to treat many diseases 
including cancer. 
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Second Series of ExDer ^™*>tn^^o 

Hereditary ntultiple exostoses (EXT) is an autosomal dominant 
disorder characterized by short stature and the development 
5 of multiple bone tumour (1-3) . Three genetic loci have been 
identified by genetic linkage analysis at chromosome 8q24.1 
(EXT1)(4), llpll-13 (EXT2) (5) and 19p (EXT3) (6). The 
putative tumour suppressor gene EXTl and EXT2 were 
identified and characterized {7,8), Recently^ two EXT-like 

10 genes, EXTLl (9) and EXTL2 (10) have also been identified, 
BXTLl and EXTL2 were mapped to chromosome lp36.1 and 
lpll-12, respectively, a region that frequently deleted in 
various tumour types. Previously reported was the isolation 
of a novel member of EXT gene family, designated TREX from 

15 mouse (11) . Reported here is the isolation of TREX from 
human and located it at chromosome 8pll-12 by fluorescence 
in situ hybridization, a region that also frequently deleted 
in various tumours. In preliminary screens, TREX 
alterations were observed in some humeui cancers. This gene, 

20 TREX, therefore, may be a novel member of EXT gene family 
cuid may be a potential candidate which appears to be 
associated with the oncogenesis of multiple human genes. 

Hereditary multiple exostoses (EXT) is an inherited multiple 
25 disorder characterized by the presence of exostoses, bony 
outgrowth capped by cartilage and with the most serious 
complication of chondrosarcomas or osteosarcomas (1-3) . EXTl 
and EXT2 were cloned (7, 8) and shown to harbor mutations in 
affected members of multiple exostoses families, defining 
30 two candidates as the genes responsible for multigene family 
of proteins with potential tumour suppressor activity. 
Recently, cuiother two members of EXT-like genes, EXTLl and 
EXTL2 were also identified (9, 10) . Both genes were mapped 
to the short arm of chromosome 1, in bands lp36 and lpll-12, 
35 respectively, a region that frequently loss of 
heterozygosity in breast (12-13), gastric cancer (14), 
colorectal polyps (15) , multiple endocrine neoplasia (16) , 
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and cervical carcinoma (17) . Nevertheless, chromosome 
localization of EXTLl and EXTL2 exclude them as candidates 
for EXT3. However, EXTLl and BXTL2 may play a role in those 
cases of multiple exostoses that cannot be linked to 
5 chromosome 8, 11 or 19. It is also possible that EXTLs might 
function as tumor suppressors in an entirely different cell 
type, due to their striking difference of chromosome 
locations. Therefore, searching for additional members of 
EXTL gene family in man and other species will be very 
10 inqportcuit . 

A novel member of multiple exostoses gene family was 
previously isolated and characterized by yeast -two hybrid 
approaches from mouse, which is also a novel component of 

15 TRAP signal complex, named mTREX (mouse TRAF-interaction EXT 
protein) (11) • To identify potential coding sequences of 
human TREX, a 500bp of mouse cDNA which does not show 
homology to EXT gene family was used to screen a human adult 
brain cDNA library (Clontech) at low stringency condition, 

20 two overlapping positive clones were identified. Clone 1, 
contains an insert size of 1614bp with a partial open 
reading frame of 1590 (530 amino acids) followed by a stop 
codon and a 24bp 3' -untranslated region. Clone 2 contains an 
insert size of 1430bp with 118bp overlapping with Clone 1 at 

25 the 3*-\mtranslated region, resiilting in 2926bp of the total 
cDNA sequence. This cDNA sequence was used to search the 
GenBank using BLAST search program and demonstrated a near 
identity and overlapping with human chromosome 8 BAC clone 
CIT987SK-2A8 (HSU96629, NCBI sequence ID g2341008, briefly 

30 as BAC 8) . This clone was obtained and a conplete sequence 
determined. To obtain cDNA covering additional portions of 
the gene a PCR-based method was used. Primers were designed 
from the sequence of BAC 8. PGR of a randomly primed, Jurkat 
total RNA with these primers produced multiple, specific 

35 bands of different sizes, which were individually cloned to 
yield the cDNA clones. The longest clone contains a 1197bp 
insert. Sequencing revealed that this clone overlapped with 
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the cDNA. clone 1 from brain cDNA library by 51 nucleotides 
at the 5* direction. To extend the hTREX to a full-length 
cDNA sequence, a modification of the 3' and 5'-rapid 
airplifications of cDNA ends (RACE) were performed, producing 
5 a series of overlapping RACE products which extended the 
cDNA sequence 637 base pairs in the 5' direction and 1527 bp 
in the 3' direction. The combination of cDNA isolation from 
cDNA library, PGR extension and RACE extension resulted in 
the complete sequence of the hTREX candidate gene of 6236 

10 bp. The whole cDNA sequence was sent to GenBank (the 
accession number is AF083551 for human TREX) . The longest 
continuous coding region is 2760bp starting at nucleotide 
638, and is preceded by 6 in frame stop codons upstream. The 
predicted 5' and 3*-tintranslated region (UTR) is unusually 

15 long as corrqpared with the 5' and 3* UTR sequences which have 
been found in some proto- oncogenes as well as human 
transforming growth factor-fi (18) . 

The cDNA sequence is identical to BAC 8 which had previously 
20 been mapped to chromosome 8p. To further determine the 
finest" chromosome location of TREX, cDNA clone containing 
the whole open reading frame was purified and hybridized to 
metaphase chromosome spreads using fluorescence in situ 
hybridization (PISH) . This analysis positioned TREX on 
25 chromosome 8pll-12 (Figure 5), a region of the genome is 
frequently deleted in tumors from human squamous cell 
carcinomas of the head and neck (SCCHN) (19) , prostate 
carcinomas (20) , breast cancers (21) , papillary bladder 
cancers (22) and colon cancers (23) , and is thus believed to 
30 contain one or more tumor suppressor loci. 

To fmrther characterize the hTREX gene and to determine the 
intron/exon boxindaries for mutationa:l analysis, hTREX 
sequences were conpared to BAC 8 genomic sequences. The TREX 
35 gene totally consists of 7 exons. The exact intron and exon 
sizes have been determined. All exon-intron splice junctions 
conform to the eukaryotic 5' -donor and 3'-acceptor consensus 
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splice junction sequence GT-AG (24) (Table 1) . Of the 6 
splice junctions, 3 occurred between codons, and 3 
interrupted codons. 

5 The fact that the TREX candidate gene showed significant 
similarity with EXT gene family and mapped within the region 
deleted in a variety of tumor types, strongly suggests that 
it is therefore a novel member of the EXT gene family as 

Table 1. The sizes and junction sequences fbr exon/introns of hTREX 









Sequences A exon-inlran Junction 


No. 


EXDII 


Intron 


3^ splicing acceptor 


S'spUdngdonor 


1 


71 


11800 




AGCCGj^aggac 


2 


94 


2033 


aaatc^QAGAG 


ACATBgtgagge 


3 


2623 
121 


13035 


tttgngGCCTG 


TCATOglaatas 


4 


6187 


afaca^GTBGT 


TTCCOji^gagag 


5 


145 


5421 


ttteaagGGTGT 


ACAAG^aagaa 


6 


129 


7433 


ctsae^TATTA 


TCAAG^gaggt 


7 


3029 




tttecagOTQAC 



well as a potential candidate for several tumor phenotypes. 
10 To facilitate the search for mutations of whole open reading 
form of TREX, 5 sets of primer pairs for PGR amplification 
and 12 sequencing primers were selected from the flgmking 
intronic or extronic sequences (Table 2) . 
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TaMe 2. Primers for PCR amplificatton and Sequencing of human TREX 



Exon 3 5' forward primer 
3' reverse primer 
sequencing primers 
(fonward) 



Exon4 5' forward primer 
3' reverse primer 
sequena'ng primers 
(forward) 

Exon 5 5' foiward primer 
3' reverse primer 
sequencing primers 
(fonward) 

Exon 6 5* fonward primer 
3* reverse primer 
sequendng primers 
(fonward) 

Exon 7 5' forward primer 
3' reverse primer 
sequencing primers 
(fonward) 



5' TTATGGCGAGTGACCCGACGTG 3' 

5' TTGCTAAAGTGAAGGAAGTTGG 3' 

5* ACCCGACGTGATCTGG 3' 

5' AAGAGCTCCTGCAGCTGG 

5* TTCTCGTTGCCCTCTCAC 3" 

5' ATCATCAATCTGTCACG 3' 

5' ACTACGATGACCGGATC 3* 

5' TTCCCTACCAGGACATGC 3' 

5' AACATGGCTGACAACG 3* 

5' TATTGGTGGTGGAGCTGG 3* 

5' AATCCAGCCATGGTCTCCTTGG 3' 
5' AGTCGATGCCATTATTACCAGC 3' 
5" TTCCTTCCTCATCACAG 3" 

5' AGGTCTGTGTATGCACTTGTG 3' 
5' AGTCGATGCCATTATTACCAGC 3" 
5" TTCAAGGGTGTGGAGAG 3' 

5" TTGGCTGAAAGCCAACAACCTG 3* 
5* AACATGCACGCATCCACAGC 3' 
5' TTGTAACACAGCATGTGG 3* 

5' GGTTCTGTCAGTATTAGCTGGG 3' 
5' TTCCTCCCTCTGCTCATCCTC 3' 
5* TTCCCACTCTGTCTCTC 3' 
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Genetic alterations of TREX were further analyzed in breast 
cancers as wells as various tumors in which frequent LOHs 
were observed on 8p. A total of 315 primary tumors 
originated from a variety of organs and 14 cancer cell lines 
5 were analyzed. Mutations in the entire coding regions as 
well as surrounding intron-exon boundaries, were analyzed, 
but no somatic mutations were detected. In Case 9, a 
thyroid cancer patient, had a 9-bp insertion in her 
constitutional DNA. This 9-bp has been inserted at a direct 
10 repeat with a T as a spacer: 5 ' - GATGAGGC -T- GATGAGGC -A-3 ' 
resulting 5' -GATGAGGC-T-GATGAGGC-T- GATGAGGC -A-3' , and amino 
acid sequence would change from Asp-Glu-Ala-Asp-Glu-Ala to 
Asp-Glu-Ala-A3P"Glu-Ala « Asp-Glu-Ala . 

15 A G to A transition at the third nucleotide of codon 171 was 
also observed in one lung cancer cell line EBC-1. This base 
substitution does not change amino acid coding. Since the 
constitutional DNA of this cell line was not available, it 
is not possible to determine whether or not this base 

20 substitution occurred somatically. Although other 328 
tumors did not harbor this base substitution, the 
possibility of a rare polymorphism cannot be excluded. A C 
to T transition at codon 605 was found only in two of 329 
tumors. Again this base substitution does not affect amino 

25 acid coding. Constitutional DNAs of the patients of these 
two tximors also harbored this base substitution. 50 normal 
volunteers were also analyzed but none of them had this base 
substitution. However, this base substitution is thought to 
be a rare polymorphism rather than germline mutation. 

30 Besides these alterations, three polymorphisms were found: 
a polymorphism with no amino acid change in exon 3, at codon 
409, and two polymosrphisms in introns 4 and 5. These 
results are siimmarized in Table 3 . 
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METHODS AND MATgRIALS 

cI»GL library screening. A SOObp of cDNA insert of mouse TREX 
was purified from a digest of pBluescript DNA by agarose gel 
5 electrophoresis, Icdoeled by random priming, and used to 
screen 1X10^^ plaques of an oligo(dT) + random primed human 
adult brain cDNA library (Clontech) at reduced stringency 
condition. Inserts from the clones identified in this way 
were trcuisf erred into pBluescript plasmids. 

10 

RT-PCR csDNA extension. Total RNA prepared from Jurkat cells 
was used for in vitro transcription. About 10 ^g of total 
RNA was used as a template in a 25 /^l RT reaction containing 
40 /zg of hexamer random primers. 10 fil of RT product was 

15 then used as a template in a 100 fil PGR reaction. Thirty 
cycles of amplification (1 min at 94 **C, 1 min at 50 **C, 2 
min at 72 **C) were performed, and the products were analyzed 
on agarose gels. Products with unique sizes were produced 
from several primers. Individual products were excised from 

20 the gel, purified form QIAquick Gel Bxtracrion Kit (QIAGEN) , 
and cloned into the pCR II vector (InVitrogen) . 

3* and 5*-RACE-Ready™ cDMAs from human brain and muscle were 
obtained from Clontech. PCR reactions were perfoimed 
25 according to the manufacturer's protocol using the primers 
supplied with the cDNAs. PCR products were cloned to pCR II 
vectors as describe above. 

DNA sequencing and analysis. DNA sequences were determined 
30 using ThermoSequenase (Amersham) , oe-^P-ddNTP labeling, and 
autoradiographic detection. Complete sequences for both 
sense and antisense strands were determined for the cDNA. 
DNA and protein sequence analysis cuid database searches were 
performed using MacVector™ sequence analysis software 
35 (Osford Molicular Group) and by BLAST program. 
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Fish Analysis 

Netaphase or prophase spreads were prepared from 
phytohemagglutinin-stitnulated peripheral blood lyn5>hocytes 
S of a normal healthy female volunteer (Inazawa et al., 1994) ( 
25) . Slides were denatured at 75 °C for 3 min in 70% 
formamide/2XSSC {0.3M NaCl, 0.03M sodium citrate, pH7) , 
immersed in 70% ethanol at -20°C, and dehydrated in 100% 
ethanol. Two-color FISH, using pBSIISK{+) -TREX, a plasmid 

10 clone which contains TREX cDNA and RMC08L009 (pJM128) , a 
plasmid clone which contains chromosome 8 centromere 
sequence (Donlon et al., 1986) (26), was performed 
essentially as described previously (Inazawa et al., 1993) 
(27) . RMC08L009 was obtained from the Resource for 

15 Molecular Cytogenetics, LBNL/UCSF. Briefly, 0.5 fzg of 
pBSIISK( + ) -TREX or 0.5 /zg of RMCO8L009 was labeled with 
biotin-16-dUTP (Boehringer Mannheim GmbH, Mannheim, Germany) 
or digoxigenin-ll-dUTP (Boehringer Mannheim) by nick 
translation, respectively. The mean fragment size of the 

20 nick- translated probes was between 300 bp cuid 600 bp. DNA 
probes were precipitated with 20 fig of sonicated salmon 
sperm DNA and 20 jug of Escherichia coli tRNA. and then 
dissolved in 30 fil of formamide. The biotin- and 
digoxigenin- labeled probes were mixed at a ratio of 5/5.5 

25 (v/v) , and human Cot-1 DNA (Gibco BRL, Gaithersburg, MD) 
dissolved in formamide was added to the mixed solution at a 
concentration of 0.4 //g/^1. This mixture was heat -denatured 
at 75 ®C for 10 min and mixed with an equal volume of 
4XSSC/20% dextran sulfate, and hybridized to slides of 

30 normal metaphase or prophase chromosomes at 37®C for 2 days 
in a humid chamber. After hybridization, the slides were 
washed for 15 min sequentially with 50% formamide/2XSSC at 
37^C, 2XSSC, IXSSC, and 4XSSC at room temperature, and 
incubated in 4XSSC/1% Block Ace (Dainippon Pharmaceutical 

35 Co., Ltd., Osaka Japam) containing avidin-FITC (15 fig/wl) 
and anti-digoxigenin-rhodamine Hug/xol) Boehringer Mannheim) 
at 37 for 40 min. Slides were washed for 10 min each in 
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4XSCC, 4XSSC/0.05% Triton X-100 and 4XSSC at room 
teinperature, and for 5 min each in 2XSSC and distilled water 
at room temperature. Slides were then counterstained with 
0.15 iig/ml of 4,6-diamidino-2 phenylindole (DAPI) in an 
5 antifade solution. 

A Nikon Eclipse E800 microscope was used for visualization 
of DAPI banding patterns and the hybridization signals. 
Digital images were acquired using a COHU high performance 
10 CCD camera (San Diego, CA) controlled with Mac Probe 3.4 
softwEure (Perceptive Scientific Instrximents , Inc., Chester, 
UK) . At least 50 metaphase or prophase cells were examined 
to determine the chromosomal location of TRBX gene. 

15 Western blotting. Proteins were ' separated by 
electrophoresis in 7.5% polyacrylamide/ SDS gels, and 
electrophoretically transferred to membranes for Ih. The 
membranes were blocked in TBS (100 mM Tris, ISOniM NaCl) 
containing 10% nonfat dried milk and 0.1% Tween-20 for 2h. 

20 Incubation of the membranes with anti-TREX monoantibody was 
performed ixi TBS containing 5% nonfat milk and 0.1% Tween 20 
for Ih and then membranes were washed with TBS containing 
0.1% Tween 20 for 30 min and detected with ECL detection 
kit. 

25 

DNA and RNA preparation. All the tumor and normal tissues 
were obtained from Department of Otolaryngology, CPMC, 
Columbia University- The histopathological classification 
was as suggested by the WHO committee. Both normal and tumor 

30 tissues were collected at the time of surgery and 
snap-frozen. High molecular weight DNAs were obtained from 
the tissue by phenol -chloroform extraction and ethanol 
precipitation. Total RNAs were prepared by using TRIzol 
Reagent (6IBC0BRL) . Sections from each of the tumors were 

35 histopathologically examined. All tumor samples contained 
greater than 90% tumor cells* 
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Mutational analysis. 10 PGR primers and 12 sequencing 
primers were designed to analyze the whole ORF of TREX, A 50 
Ml reaction contained 150 ng genomic DNA, 20 pmol of each 
primer, IX Expand™ High Fidelity PGR buffer (Boehringer 
Mannheim) , and 2.6 U Expand™ High Fidelity PGR System enzyme 
mix (Boehringer Mannheim) . After an initial denaturation for 
2 min at 94 ^C, 30 cycles of 20 S at 94 ^C, 30 s at 60 <*C, 
and 3 min at 68 and final extension for 7 min at 68 <>C 

were carried out in a PGR microtube thermal Cycler (Perkin 
Elmer) . Direct sequencing of PGR products was performed 
after pre- treatment by Pre-PGR sequencing kit (Amersham) 
using the secjuencing primers as described above. All 
mutations were confirmed by sequencing a newly amplified 
product . 



wo 00/18959 



PCTAJS99/216S4 



-59- 

REFBRBNCES FOR THE SECOND SERIES OF SXPERIMENTS 

1. Schraale, Conrad, E.U. & Raskind, W.H. The 
nattiral history of hereditary multiple exostose s. J, 
Bone Joint Sura. Am . 76, 986-992 (1994). 

2. Leone, N.C. et al. Genetic heterogeneity in families 
with hereditary multiple exostoses. Am. J, hum. 
Genet. 53, 71-79 (1993). 

3. Luckert-Wichlund, C.L. et al. Natural history study 
of hereditary multiple exostoses. Am. J. Med. Genet. 
55; 43-46 (1995). 

4. Ludecke, H.J. et al. Molecular dissection of a 
contiguous gene syndrome: localization of the genes 
involved in the Langer-Giedion Synchrome. Hum. Mol. 
Genet. 4, 31-36 (1995). 

5. Wuyts, W. et al. Refinement of the multiple 
exostoses locus (EXT2) to a 3-cM interval on 
chromosome 11. Am. J. Hum. Genet . 57, 382-387 
(1995) . 

6. Le Merrer, M. et al. A gene for hereditary multiple 
exostoses maps to chromosome 19p. Hum. Mol. Genet . 
3, 717-722 (1994). 

7. Ahn, J. et al. Cloning of the putative tumour 
suppressor gene for hereditary multiple exostoses 
(EXTl) . Nature Genet , 11, 137-143 (1995) . 

8. Sticken, D. et al. The EXT2 multiple exostoses gene 
defines' a family of putative tumour suppressor 
genes. Nature Genet . 14, 25-32 (1996). 



9. 



Wise, C. et al. Identification and localization of 



wo 00/18959 



PCTAJS99/21654 



-60- 

the gene for EXTL, a third member of the multiple 
exostoses gene family. Genome Res . 7, 10-16 (1997). 

10. Wuyts, w., et al. Identification and 
5 characterization of a novel member of the EXT gene 

family, EXTL2. Eur. J. Hum. Genet . 5, 382-389 
(1997) , 

11. Kashiwada, M. et al. TREX, a Novel Gene of 
10 Hereditairy Multiple Extoses (EXT) Gene Family, 

Involved in TRAF-mediated Signaling (in press) . 

12. Hoggard, N. et al. Allelic imbalance on chromosome 
1 in human breast cancer. Microsatellite repeat 

15 analysis. Genes Chromosomes Cancer 12, 24-31 (1995) . 

13. Nagai, H. et al. Detection auid cloning of a common 
region of loss of heterozygosity at chromosome Ip in 
breast cancer. Cancer Res , 55, 1752-1757 (1995). 

20 

14. Ezaki, T., et al. Deletion mapping on chromosome Ip 
in well -differentiated gastric cancer. Br. J. Cancer 
73, 424-428 (1996) . 

25 15. Mulligan, J.M. et al. Genetic events in tumour 

initiation and progression in multiple endocrine 
neoplasia type 2. Genes Chromosomes Cancer 6,166-177 
(1993) . 

30 16. Lothe, R.A. et al. Deletion of Ip loci and 

microsatellite instability in colorectal polyps. 
Genes Chromosomes Cancer 14, 182-188 (1995) . 



17. 

35 



Zimonjic, D.B. et al. Molecular cytogenetics of 
human papillomavirus negative cervical carcinoma 
cell lines. Cancer Genet. Catoaenet. 82, 1-8 (1995). 



wo 00/18959 PCTAJS99/21654 

-61- 

18. Kazak, M. An analysis of 5'-noncoding sequences from 

699 vertebrate messenger RNAs. Nucl. Acids Rea . 15, 
8125-8148 (1987) . 

5 19. Cowan, J.M. , Beckett, M.A. , Weichselbaum, R.R. 

Chromosome changes characterizing in vitro response 
to radiation in human squamous cell carcinoma lines. 
Cancer Res , 53, 5542-5547 (1993) . 

10 20. Vocke, CD. et al. Analysis of 99 microdissected 

prostate cardinoma reveals a high frequency of 
allelic loss on chromosome 8pl2-21. Cancer Res . . 56, 
2411-2416 (1996) . 

15 21. Courjal F. et al- Mapping of DNA amplifications at 

15 chromosomal localizations in 1875 breast tumors: 
definition of phenotypic groups. Cancer Res . 57, 
4360-4367 (1997) . 

20 22. Richter J., et al. Meurked genetic differences 

between stage pTa and stage pTl papillary bladder 
cancer detected by conparative genomic 
hybridization. Cancer Res , 57, 2860-2864 (1997). 

25 23. Tanaka, K. et al. Suppression of tumorigenicity and 

invasiveness of colon carcinoma cells by 
introduction of normal chromosome 8pl2-pter. 
Oncogene . 12, 405-410 (1996) . 

30 24. Shapiro, M.B. and Senapathy, P. RNA splicing 

junctions of different classes of eukaryotes: 
sequence statistici3 and functional implications in 
e39ression. Nucleic Acids Res , 15, 7155-7174 (1987) , 



35 25. 



Inazawa J., Ariyama T., Tokina T. , Tanigami A., 
Nakamura Y., Abe T. (1994) High resolution ordering 
of DNA markers by multi-color fluorescent in situ 



wo 00/18959 



PCTAJS99/21654 



-62- 

hybridization of prophase Chromosomes. Cytogenet 
Cell Genet 65:130-135. 

26. Donlon T. , Wyman A.R., Mulholland J., Barker D., 
Bruns G., Latt S., Botstein D. (1986) Alpha 
satellite-like sequences at the centromere of 
chromosome #8 Am. J. Hum. Genet. 39: A196. 

27. Inazawa J., Saito H., Ariyama T., Abe T., Nakamura 
y. (1993) High resolution cytogenetic mapping of 342 
new cosmid markers including 43 RFLP markers on 
human chromosome 17 by fluorescence in situ 
hybridization. Genomics 17:153-162. 



wo 00/18959 



PCT/US99/21654 



-63- 

TRZHD SERIES OF EZPERIMSNTS 

Abbreviations used herein: . TNP-a, tumor necrosis factor-a; 
NF-kB, nuclear factor-KB, TRAP, tumor necrosis factor 
receptor-associated factor; PGR, polymerase chain reaction; 
5 RACE, rapid amplification of cDNA ends; PBS, phosphate- 
buffered saline; luc, lucif erase; HEK, human embryo kidney; 
HA, hemgglutinin; PMSF, phenylmethylsulfonyl fluoride; 
TRITC, trimethylrhodamineisothiocynate; EGFP, enhanced green 
fluorescent protein. 

10 

EXTL3 is a member of the EXT gene family and a putative 
tumor suppressor gene. Here we identified the cDNA encoding 
mouse homolog of EXTL3 and examined the effect of its 
expression on nuclear factor-KB (NF-kB) activity. The mouse 

15 EXTL3 protein is 97% homologous to tfie human EXTL3 • 
Northern blot analysis indicated that mouse EXTL3 is 
ubiquitously expressed in tissues, with highest expression 
in the heart, brain, and skeletal muscle. Over expression 
of EXTL3 enhanced tumor necrosis f actor-a (TNP-a) - and tumor 

20 necrosis factor receptor-associated factor 2 (TRAF2) -induced 
NF-kB activation. Structure- functional analysis revealed 
that the transmembrane region near the amino teinninus was 
required for this effect of mouse EXTL3 on NF-kB activity. 
The results of stibcellular localization studies revealed 

25 that EXTL3 was expressed predominantly at the endoplasmic 
reticulum. Interestingly, co-expression of EXTL3 with TRAF2 
facilitates to change in distribution of EXTL3 and TRAF2 
surrounded the EXTL3 -containing vesicle caused by TRAF2. 
These results strongly suggest that EXTL3 may modulate a 

30 signal Ceuscade mediated by TNF-a. 

Tumor necrosis factor a (TNF-a)^ is a potent inflammatory 
cytokine that generates two different signals: it induces 
apoptosis, and it activates the transcription factor NF-kB 
35 (1, 2) . The inhibition of NF-kB during TNF-a stimuli 
results in apoptosis in various cell lines which are 
originally resistant to TNF-a- induced cell death (3-5) . 
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Therefore, activation of NF-kB likely induces the expression 
of genes that coxinteract apoptotic signals and prevent cell 
death . 

5 Hereditary multiple exostoses syndrome (EXT) is an autosomal 
dominant disorder characterized by the formation of multiple 
cartilage-capped tumors that develop from the outgrowth 
plate of endochondral bone (6) . Genetic linkage analysis 
has mapped loci for EXT at chromosomes 8q24.1 (EXTl) (7, 8) , 

10 llpll-13 (EaCT2) (9, 10), and 19p (EXT3) (11). Both EXTl 
(12) and EXT2 (13) genes have been identified; these 
proteins share extensive sequence similarity, especially at 
the carboxyl terminus. The three EXT- like genes, EXTLl 
(14), EXTL2/EXTR2 (15, 16), and EXTL3/EXTR1 (16, 17), which 

15 also share considerable homology, have heen assigned to 
human chromosomes lp36.1, lp21, 8p21, respectively. Because 
these chromosomal regions have been associated with high 
frequent loss of heterozygosity in various humsin cancers, it 
has been thought that putative tumor suppressor genes exist 

20 in these loci (18-20) . Therefore, the EXT family including 
EXTL3 may represent a class of putative tiunor suppressors. 

Recently, EXTl and EXT 2 were identified as 
glycosyltransferases required for biosynthesis of heparin 
25 sulfate (21, 22). However, functional role to another 
member of the family is still not defined. Here we report 
that mouse EXTL3 affects NF-kB activity stimulated by TNF-a. 
We also describe the subcellular localization of this 
protein at the endoplasmic reticulum. 

30 

MATERIALS MtD METHODS 

Materials • Recombinant human tumor necrosis factor-a (TNF- 
Qf) was obtained from R&D Systems, Inc. (Minneapolis, MN) . 
TRITC-conjugated concanavalin A was obtained from Sigma (St. 
35 Louis, MO) . Fetal calf serum (FCS) was obtained from 
HyClone (Logan, UT) . The NF-KB-dependent reporter gene 
construct pELAM-luc, in which the human E-selectin promoter 
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region (-730/+52) has been inserted into pGL3 by using 
Sacl/Bglll sites, was kindly provided by MBL (Nagoya, 
Japan) . 

5 cDNA cloning of mouse EXTL3 . Mouse EXTL3 cDNA was isolated 
from the Mouse Brain 5'-Strech Plus cDNA library (Clontech, 
California, CA) by using human EXTL3 as a probe. To extend 
the partial sequence, RACE was carried out as described in 
the manufacturer's manual (Clontech) . 

10 

Northern blot analysis. A Northern blot filter containing 
mouse poly (A) + RNAs from eight different tissues was 
purchased from Clontech, The filter was hybridized with the 
1.2 kb EXTL3 cDNA fragment that contains the entire open 
15 reading frame as reconstructed from the RACE product. 

Plasmid construction and transf ection • To construct the 
expression plasmid, we PCR-amplif ied the full length EXTL3 
cDNA fragment by using the forward primer ( 5 ' - 

20 CGCGGATCCACCATGACAGGCTATACCATGTTGCGGA-3') , which contains a 
BamHI ' site, and the reverse primer (5'- 
CCCAAGCTTTAGATGAACTTGAAGCACTTGGT-3'), which contains a 
Hind u I site* To construct the deletion mutant lacking the 
N-terminal region (AN), the AN fragment was amplified by 

25 using the forward primer (5'- 

CGCGGATCCACCATGTCCTACAAGGAGCTGATGGCCCA-3' ) and the reverse 
primer used for the full-length fragment. To construct the 
deletion mutant lacking the c- terminal region (AC) , the AC 
fragment was amplified by using the reverse primer 5'- 

30 CCCAAGCTTGCTACCTCTTCCCGQATGGGAGCA-3' and the same forward 
primer as that for the full-length fragment. For the 
deletion mutctnt lacking both the N- and C-terminal portions 
(N&C) , the AN&C fragment was amplified by using the same 
forward primer as that for the AN fragment and the reverse 

35 primer used to generate the AC fragment. After digestion 
with BatrHI and Hindi! I, full-length and truncated EXTL3 PCR 
products were ligated into pcDNA3 -1 (-) /Myc-His B 
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(invitrogen, Carlsbad, CA) such that the myc epitope tag and 
the 6xhis tag were in-frame for subsequent translation. 

For construction of EGPP- tagged EXTL3 es^ression plasmids, 
the full-length coding region for mouse EXTL3 and the AN 
region was PCR-amplif ied by using the forward primer 5'- 
CCCAAGCTTACCATGACAGGCTATACCATGTTGCGGA-3 ' and the reverse 
primer used for the full-length fragment described 
previously. In addition, the AN region was generated by 
using the forward primer 5"- 

CCCAAGCTTACCATGTCCTACAAGGAGCTGATGGCCCA and the same 

reverse primer used for the full-length fragment. After 
digestion with Hindlll, the full-length and AN EXTL3 PGR 
products were ligated into pEGFP-N2 (Clontech) such that 
EGFP was in- frame for subsequent translation. 

Full-length coding regions of mouse TRAF2 and TRAF3 were 
amplified by PGR and subcloned into FLAG-tagged pCR3.1 
(Invitrogen) . Full-length coding regions of human TRAF2 
were amplified cuid siibcloned into hemagglutinin (HA) -tagged 
pcDNA3 (Invitrogen) . 

Cellculture and trans fection. Human embryo kidney 293 
{HEK293) cells were maintained in Eagle's minimum essential 
medium containing 10% fetal calf serxam, 100 U/mL penicillin, 
and 100 ug/xtiL streptomycin (GIBCO-BRL, Grand Island, NY) . 
For experiments, HEK293 cells were seeded at a density of 10* 
cells/dish in 10 -cm culture dishes and were cultured for 3 
days. Then, the cells were transfected by standard calcium 
phosphate co -precipitation method using commercial solution 
(Sprime 3prime inc.). 

Preparation of nuclear extracts* For nuclear extracts, 
cells were treated with or without TNF-a (20 ng/mL) for 1 h, 
washed with ice-cold PBS, and detached by using 5 mM EDTA in 
PBS. After pelleting, the cells were resuspended in wash 
buffer (10 raM Tris-HCl tpH 7.5], 130 roM NaCl, 5 mM KCl, 8 mM 
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MgClg, then pelleted and resuspended in hypotonic buffer (20 
mM HEPBS-KOH [pH 7.91, 5 mM KCl, 0.5 mM MgClg, 0.5 mM DTT, 
0.5 iriM PMSF) . After incubation for 10 min on ice, the cell 
suspension was homogeixized by using five strokes in a Dounce 
5 homogenizer. The homogenate was centrifuged for 10 min at 
4000 rpm. Sedimented nuclei were resuspended in extraction 
buffer (20 mM HEPES-KOH [pH7.9], 25% glycerol, 500 mM NaCl, 
1-5 mM MgClj, 0.2 mM EDTA, 0.5 mM DTT, 0.5 mM PMSF, 0.5 fig/xoL 
pepstatin A, 1.3 /xq/ml spermidine) and broken by using five 

10 strokes in a Dounce homogenizer. After vortexing for 1 h, 
the nuclear suspension was centrifuged for 10 min at 15,000 
rpm. The supernatant was dialyzed against binding buffer 
(20 mM HEPBS-KOH [pH 7.9], 10* glycerol, 50 mM KCl, 1.5 itiM 
MgClg, 0.5 mM EDTA, 0.5 mM DTT, 0.5 niM PMSF). After 

15 centrifugation, the supernatant was used as the nuclear 
extract . 

Electrophoretic mobility shift assays. Electrophoretic 
mobility shift assays were performed by incubating an 

20 aliquot of nuclear extract containing 5 U3 protein with 2 jig 
poly(dl-dC) Amersham Pharmacia, Uppsala, Sweden) in assay 
buffer (13mM HEPES [pH 7.8], 50 mM KCl, 4.3 mM MgCl^, 10% 
glycerol, 0.3 mM DTT, 0.3 mM PMSF [final volume, 30 fi ll ) . 
The binding reaction was started by adding endo- labeled NF- 

25 KB-specific oligonucleotide (Promega, Madison, WI) with 
[y^^PlATP (Amersham Pharmacia) and T4 polynucleotide kinase 
and the reaction mixture was incubated for 30 min at room 
teirqperature . The samples were separated by polyacrylamide 
gel electrophoresis in low ionic-strength buffer (0.25xTriB- 

30 borate-EDTA) . Activated NF-kB complexes were identified by 
using super-shift analysis with an antibody that recognizes 
NF-kB subunit (Santa Cruz, California, CA) . 

Luciferase assay. For a reporter gene assay, HEK293 cells 
35 were transfected with 500 ng of the NF-kB -dependent reporter 
gene construct pELAM-luc, 500 ng of the internal control 
construct pRL-TK (Promega) and 10 jxg of each expression 
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construct needed. DNA concentrations Were kept constant by 
supplementation with enpty vector. Cells were lysed 24 h 
after transfection, and reporter gene activity was 
determined by using the Dual lucif erase assay system 
5 (Promega) . Luminescence was measured in a Lumat LB 9507 
(BERTHOLD GmbH &. Co. KG, Bad Wildbad, Germany) . 

Fluorescence microscopy. HEK293 cells cultured on cover 
glasses were transfected with the EGFP- tagged EXTL3 

10 construct and the FLAG- tagged TRAFs constructs by a standard 
calcium phosphate co-precipitation method. The cells were 
fixed with 3.7% formalin in PBS for 10 min at room 
temperature 24 h after transfection. The cells were washed 
three times with PBS and treated with 0.2% Triton X-100 in 

15 PBS for 5 min, followed by a 30 min incubation in blocking 
solution (PBS containing 5% BSA) . After blocking, the cells 
were incubated with 100 jxg/mL TRITC- conjugated concanavalin 
A for 30 min. The cells were washed three times with PBS 
and then incubated with M2 anti-FLAG monoclonal antibody 

20 (Sigma) at 20 jig/ml in 0.1% BSA in PBS for 1 h. Cells were 
washed three times with PBS then inciibated with Cy5- 
conjugated anti-mouse IgG antibody (Jackson ImmunoRe search 
Laboratories, Inc., West Grove, PA) at 20 ^g/ml in 0.1% BSA 
and 0.1% Tween 20 in PBS for 1 h. The cells were then 

25 washed with PBS and moimted on slide glasses. Fluorescence 
was visualized by using a Carl Zeiss LSM510 confocal laser 
scanning microscope (Oberkochen, Germany) . 

Accession Number. The Genbank accession number for mouse 
30 EXTL3 is AF083550. 
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RESULTS 

Cloning of murine EXTLS dESSSm and distribution of Its mRMA in 
various tissues. From the mouse brain cDNA library, several 
colonies were selected by using htrnian EXTL3 cDNA. as a probe. 
5 To extend the partial sequence, RACE were carried out as 
described in the manufacturer's manual. An open reading 
frame encoding a predicted protein of 918 amino acids was 
obtained. Mouse EXTL3 protein is 97% homologous to the 
human protein (Fig. 9A) . 

10 

A Northern blot filter containing mouse poly (A) + RNAs from 
eight different tissues was hybridized with a 1.2 kb 
fragment of mouse EXTLS cDNA. A single transcript of 6.0 kb 
was detected in all tissues examined, with highest 
15 expression in heart, brain, and skeletal muscle (Fig. 9B) . 
The results are consistent with those associated with human 
EXTL3. 

Effect of EXTL3 protein expression on NF-kB activity* To 

20 investigate the effects of EXTL3 on TNF-cy- induced NF-kB 
activation, an electrophoretic mobility shift assay was 
carried out. NF-kB activation was detected in the nuclear 
extract stimulated by TNF-a (Fig. lOA) . The super shift of 
the band with anti-NP-kB p50 subunit antibody or anti-NF-kB 

25 p65 subunit antibody was observed. These results might 
indicate that the p65/p50 heterodimer was formed in TNF-of- 
treated HEK293 cells. In EXTL3-transfected cells, TNF-Of- 
induced NF-kB activation was enhanced markedly (Fig. lOA) . 
To confirm this finding, we also examined the effect of 

30 EXTL3 on NF-kB activation by using a luciferase assay. Over 
expression of EXTL3 enhanced TNF-a- induced NF-kB activation 
in a concentration- dependent manner (Fig. lOB) . Similar 
results were obtained when EXTL3 was co-e35)ressed with TRAF2 
(Fig. IOC) . 

35 



EXTL3 has a putative transmembrane region at its N-terminus 
and the EXT domain at its C-terminus (Fig.llA) . The EXT 
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domain comprises two subdomains, EXT-N and EXT-C. To 
determine the region necessary for the enhancement of NF-kB 
activation, we constructed a series of EXTL3 deletion 
mutants and investigated their effect on NF-kB activation- 
5 The results revealed that enhancement of NF-kB activation 
was not detected in N-terminal trtmcated EXTL3 expressed 
HEK293 cells, but the C- terminal truncation mutant enhanced 
NF-kB activation (Fig. IIB and IIC) . These results showed 
that the transmembrane region closer to the N-terminus was 
10 required for modulation of NF-kB activation induced by TNF-a 
or TRAF2. 

Cellular location o£ SXTL3 protein. To determine the 
subcellular localization of EXTL3, HEK293 cells were. 

15 transiently transfected with the BGFP-tagged EXTL3 
expression plasmid. As shown in Fig. llD-b, EXTL3 protein 
is detected at the endoplasmic reticulum. By contrast, the 
localization pattern of the N-terminal deletion mutant is 
similar to that of EGFP (Fig. llD-a and llD-c) , These 

20 results suggested that the transmembrane region closer to 
the N-terminus is necessary for pre-nuclear localization. 

To elucidate the role of the EXTL3 protein in TNF^of 
signaling, we examined the effects of TR7^2 and TRAF3 on the 
subcellular distribution of EXTL3. Although no change in 

25 EXTL3 localization was observed in HEK293 cells co- 
transfected with TRAF3, TRAF2 affected the subcellular 
distribution of EXTL3 (Fig. 12) . TRAF2 caused the formation 
of vesicles containing EXTL3. As shown in Fig. 12H, the 
EXTL3 localization and the region stained with TRITG- 

30 conjugated concanavalin A clearly overlap. This result is 
consistent with localization of EXTL3 at the endoplasmic 
reticulum. However, EXTL3 -containing vesicles appeared in 
cells co-expressing TRAF2 cells that were not stained with 
concanavalin A (Fig. 12D) . Interestingly, TRAF2 existed at 

35 the surface of these vesicles. 

DISCUSSION 
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In the present study, we demonstrate that EXTL3 markedly 
enhauices both TNF-a- and TRAF2 -induced NF-kB activation, 
although EXTL3 slightly stimulates NF-kB activity in itself. 
The study using EXTL3 truncation mutants demonstrates that 
5 the N~ terminal region containing a putative transmembrane 
domain is required for EXTL3 -associated enhancement of NF- 
kB. Indeed, EXTL3 locates at endoplasmic reticulum, which 
consists with prediction based on the amino acid sequence 
(17) . Therefore, the correct sorting of EXTL3 may be 
10 necessary for the enhancement of TNF-a- and TRAF2- induced 
NF-kB activation. 

Previous studies demonstrated that several TRAFs associate 
with the TNF receptor and initiate signal transduction. 

15 TRAF2, but not TRAF3, is responsible for the activation of 
NF-kB (23). We demonstrated that EXTL3 -contented, vesicles 
appear in TRAF2 co-transf ected cells but not in TRAF3 co- 
transfected cells. Moreover, TRAF2 exists on the surface of 
these vesicles. These also inplicate EIXTL3 in TNF -a -induced 

20 signal transduction. Recently, numerous protein mediating 
signals initiated by TNF-a have been identified (24) , There 
is a possibility that EXTL3 affects the function of these 
proteins such as TRAF2. Several groups reported that the 
activation of NF-kB prevents apoptosis (3-5) . Here, we 

25 report that BXTL3 may involved in the TNF- a- induced NF-kB 
activating pathway, which may help to understand the tumor 
suppressor activity of EXTL3. 

Heparin sulfate proteoglycans are ubiquitously present on 
30 the cell surface and in the extracellular matrix. Heparin 
sulfate chains interact with a variety of proteins and are 
therefore implicated not only in various cellular responses 
but also in diverse physiological phenomena (25) . The role 
of glycosaminoglycan in the transmembrane signaling induced 
35 by fibroblast growth factor is well documented (28-30) - 
Recently, it has been reported that EXTl and EXT 2 encode 
glycosyltraneferases involved in the chain-elongation step 
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of heparin sulfate (21, 22) . Therefore, another member of 
EXT family, perhaps EXTL3, also may be involved in 
glycosaminoglycan synthesis- Indeed, EXTL3 localizes to the 
endoplasmic reticulum, as EXTl does (21, 26) . Beside this, 
5 TNF-a has an affinity for heparin (27) . These let us 
speculate that glycosaminoglycan may play a pivotal role in 
TNF-ce- induced signal transduction as well as in fibroblast 
growth factor- induced signaling, but further studies are 
required to confirm our hypothesis. 

10 

The chromosomal localization of EXTL3 has been assigned to 
8p21 (16, 17, 31) and the EXTL3 gene was mapped in the 
common region of deletion in primary breast cancer (31) . 
The extensive mutation search was performed using the 329 

15 primcury humeui cancers including chondrosarcomas, breast and 
liing cancers and the results revealed that the frecpient 
somatic mutation was not detected in the sporadic human 
cancers (31d) , suggesting that EXTL3 may not be involved in 
tumor development and/or progression. However, loss of 

20 hetrozygosity in the EXTL3 gene may cause unbalance of the 
regulation of NF-kB activation by TNFR-mediated signal 
transduction and eventually its loss of EXTL3 function may 
contribute to inhibition of apoptosis in primary human 
cancers. Further studies will be necessary to better 

25 underst2uidings of association between BXTL3 function and 
tumor development and/ or progression. 
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What is claimed is: 

1. An isolated nucleic acid molecule encoding a Tumor 
necrosis factor Receptor-Associated Factor (TRAP) 
protein- interacting hereditary multiple extoses (TREX) 
protein. 

2. The isolated nucleic acid molecule of claim 1, wherein 
the nucleic acid molecule is a DNA molecule. 

3 - The isolated DNA molecule of claim 2 , wherein the DNA 
molecule is a cDNA molecule. 

4. The isolated DNA molecule of claim 2, wherein the DNA 
molecule is a genomic DNA molecule. 

5- The isolated nucleic acid of claim 1, wherein the nucleic 
acid molecule is an RNA molecule. 

6. The isolated nucleic acid molecule of claim 1, wherein 
the nucleic acid molecule encodes a mammalian Tumor 
necrosis factor Receptor-Associated Factor (TRAP) 
protein- interacting hereditary multiple extoses (TREX) 
protein. 

7. The isolated nucleic acid molecule of claim 1, wherein 
the mammalian Tumor necrosis factor Receptor-Associated 
Factor (TRAP) protein- interacting hereditary multiple 
extoses (TREX) protein is a mouse, rat, or human Tumor 
necrosis factor Receptor-Associated Factor (TRAP) 
protein- interacting hereditary multiple extoses (TREX) 
protein . 

8. The isolated nucleic acid molecule of claim 6, wherein 
the nucleic acid molecule encodes a Tumor necrosis factor 
Receptor -Associated Factor (TRAP) protein- interacting 
hereditary multiple extoses (TREX) protein comprising an 
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amino acid secjuence as set forth in Figure 7B (SEQ ID 
N0:2) . 

9. The isolated nucleic acid molecule of claim 8, wherein 
5 the amino acid sequence comprises an isbleucine zipper 

motif and a hereditaiy multiple extoses C (EXT C) domain. 

10. The isolated nucleic acid molecule of claim 6, 
wherein the nucleic acid molecule encodes a Tumor 

10 necrosis factor Receptor-Associated Factor (TRAP) 

protein- interacting hereditary multiple extoses 
(TREX) protein, wherein the Tumor necrosis factor 
Receptor-Associated Factor (TRAP) protein- 
interacting hereditary multiple extoses (TREX) 
15 protein has substantially the same amino acid 

sequence as set forth in Figures 7B (SEQ ID NO: 2) . 

The isolated nucleic acid molecule of claim 6, 
wherein the nucleic acid molecule encodes a Tumor 
necrosis factor Receptor-Associated Factor (TRAF) 
protein- interacting hereditairy multiple extoses 
(TREX) protein, wherein the Tumor necrosis factor 
Receptor -Associated Factor (TRAF) protein- 
interacting hereditary multiple extoses (TREX) 
protein has the amino acid sequence as set forth in 
Figure 7B (SEQ ID NO: 2) . 

The isolated nucleic acid molecule of claim 6, 
wherein the nucleic acid molecule encodes a Tumor 
necrosis factor Receptor-Associated Factor (TRAF) 
protein- interacting hereditary multiple extoses 
(TREX) protein comprising an amino acid sequence as 
set forth in Figure 8B (SEQ ID NO: 4) . 

35 13. The isolated nucleic acid molecule of claim 12, 

wherein the amino acid sequence cotiprises an 
isoleucine zipper motif and a hereditary multiple 



11, 

20 
25 



12. 

30 
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extoses C (EXT C) domain. 

14. The isolated nucleic acid molecule of claim 6, 
wherein the nucleic acid molecule encodes a Tumor 

5 necrosis factor Receptor-Associated Factor (TRAF) 

protein- interacting hereditary multiple extoses 
(TREX) protein, wherein the Tumor necrosis factor 
Receptor-Associated Factor (TRAF) protein- 
interacting hereditary multiple extoses (TREX) 
10 protein has substantially the same amino acid 

sequence as set forth in Figure 8B (SEQ ID N0:4) . 

15. The isolated nucleic acid molecule of claim 6, 
wherein the nucleic acid molecule encodes a Tumor 

15 necrosis factor Receptor-Associated Factor (TRAF) 

protein- interacting hereditary multiple . extoses 
(TREX) protein, wherein the Tuiror necrosis factor 
Receptor -Associated Factor (TRAF) protein- 
interacting hereditary multiple extoses (TREX) 

20 protein has the amino acid sequence as set forth in 

Figure 8B (SEQ ID NO: 4) . 

16. An isolated nucleic acid molecule encoding a mutant 
homolog of the mammalian Tumor necrosis factor 

25 Receptor-Associated Factor (TRAF) protein- 

interacting hereditary multiple extoses (TREX) 
protein whose genetic alteration is set forth in 
Table 3. 

30 17. The isolated nucleic acid molecule of claim 12, 

which is a deletion mutant. 

18. The deletion mutant of claim 17, wherein the encoded 
mutant homolog comprises a tumor suppressor locus. 

35 

19. The deletion mutant of claim 17, wherein the encoded 
mutant homolog does not comprise a tumor suppressor 
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locus domain. 



20. The isolated nucleic acid molecule of claim 6, 
wherein the mammalian TREX comprises a mouse nucleic 

5 acid sequence set forth in Figure 7A (SEQ ID N0:1) . 

21. The isolated nucleic acid molecule of claim 6, 
wherein the mammalian TREX comprises a human nucleic 
acid sequence set forth in Figure 8A (SEQ ID N0:3) . 

10 

22 . A vector comprising the nucleic acid molecule of 
claim 1. 

23. The vector of claim 22 adapted for expression in a 
15 host cell which comprises the regulatory elements 

necessary for expression of the nucleic acid 
molecule in the host cell operatively linked to the 
nucleic acid molecule encoding the Tumor necrosis 
factor Receptor-Associated Factor (TRAF) protein- 
20 interacting hereditary multiple extoses (TREX) 

protein so as to permit expression of the TREX 
protein . 

24. The vector of claim 23, wherein the host cell is a 
25 eukaryotic, bacterial, insect or yeast cell. 

25. Uie vector of claim 24, wherein the eukaryotic host 
cell is a mammalian cell. 



30 26. The vector of claim 25, wherein the vector is a 

plasmid. 

27. A vector comprising the nucleic acid molecule of 
claim 3 . 

35 

28. The vector of claim 27 adapted for expression in a 
host cell which coirprises the regulatory elements 
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necessary for expression of the nucleic acid 
molecule in the host cell operatively linked to the 
nucleic acid molecule encoding the Tumor necrosis 
factor Receptor -Associated Factor (TRAP) protein- 
interacting hereditary multiple extoses (TREX) 
protein as to permit expression of the TREX protein. 

29- The vector of claim 28, wherein the host cell is a 

eukaryotic, bacterial, insect or yeast cell. 

30. The vector of claim 29, wherein the eukaryotic host 

cell is a mammalian cell. 



31. The vector of claim 30, wherein the vector is a 
plasmid. 

32. A method of producing a host cell operatively linked 
to the nucleic acid molecule encoding a Tumor 
necrosis factor Receptor-Associated Factor (TRAF) 
protein- interacting hereditary multiple extoses 
(TREX) protein, which comprises growing a host cell 
comprising the vector of claim 29 under suitable 
conditions permitting production of the TREX protein 

and recovering the TREX protein so produced. 

33. The method of claim 32, further conprising purifying 
the recovered TREX protein - 



34. A method of producing a polypeptide having the 

biological activity of a protein encoded by the 
nucleic acid molecule encoding a Tumor necrosis 
factor Receptor-Associated Factor (TRAF) protein- 
interacting hereditary multiple extoses (TREX) 
protein which comprises growing the host cells of 
claim 29 under suitable conditions permitting 
production of the polypeptide and recovering the 
polypeptide so produced. 
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35. The method of claim 34, further comprising purifying 

the recovered polypeptide. 

36- A purified raammalicoi Tumor necrosis factor Receptor- 

Associated Factor (TRAP) protein- interacting 
hereditary multiple extoses (TREX) protein. 

37. The purified mammalicoi Tumor necrosis factor 
Receptor-Associated Factor (TRAP) protein- 
interacting hereditary multiple extoses (TREX) 
protein of claim 36 which is a human TREX protein. 

38 . A protein comprising substantially the amino acid 
sequence set forth in Figure 7A. 

39. A protein comprising substantially the amino acid 
sequence set forth in Figure 8A. 

40. An oligonucleotide comprising a nucleic acid 
molecule of at least 15 contiguous nucleotides 
capable of specifically hybridizing with a unique 
sequence included within the sequence of the 
isolated nucleic acid molecule encoding a Tumor 
necrosis factor Receptor- Associated Factor (TRAF) 
protein- interacting hereditary multiple extoses 
(TREX) protein of claim 1. 

41. The oligonucleotide of claim 40, wherein the nucleic 
acid is DNA. 

42. The oligonucleotide of claim 40, wherein the nucleic 
acid is RNA. 

43 . An antisense oligonucleotide comprising a sequence 
capable of specifically hybridizing with a unique 
sequence included within the mRNA molecule of claim 
5. 
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44. An antisense oligonucleotide comprising a sequence 
capcLble of specifically hybridizing with a unique 
sequence included within the genomic DNA molecule of 
claim 4. 

45. An antdLbody capable of binding to the protein of any 
of claims 36, 37, 38 and 39, 

46. An antibody capable of binding to the protein of any 
of claims 36, 37, 38 and 39, wherein the antibody is 
a monoclonal emtibody. 

47. An antibody capable of binding to the protein of any 
of claims 36, 37, 38 and 39, wherein the antibody is 
a polyclonal antibody. 

48. A monoclonal antibody directed to an epitope of a 
Tumor necrosis factor Receptor-Associated Factor 
(TRAF) protein- interacting hereditary multiple 
extoses (TREX) protein. 

49. A method of inhibiting TREX protein interaction with 
a TRAF protein comprising administering a ligand 
comprising an amino acid domain which binds to a EXT 
C domain of the TREX protein so as to inhibit 
binding of the TREX protein to the TRAF protein - 

50. A method of inhibiting overexpression of TREX 
protein comprising administering the antisense 
oligonucleotide of claim 43 which binds to an mRNA 
molecule encoding a human Tumor necrosis factor 
Receptor -Associated Factor (TRAF) protein- 
interacting hereditary multiple extoses (TREX) 
protein so as to inhibit overexpression of the human 
TREX protein. 

51. The method of claim 50, wherein inhibiting 
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overexpression of TREX protein thereby inhibits 
TRT^-induced CD40 signal dependent NF-kB activation. 

52. The method o£ claim 49, wherein the ligand is an 
antibody capable of binding to the TREX protein. 

53 . The method of claim 52 , wherein the antibody is a 
monoclonal or a polyclonal antibody. 

54- A method of inhibiting growth of a tumor cell 

comprising blocking a TRAP interacting site of a 
TREX protein by administering a ligEmd capable of 
binding to the TRAP interacting site of a TREX 
protein. 

55. The method of claim 54, wherein the TRAP interacting 

site is a hereditary multiple extoses C (EXT C) 
domain . 



56. The method of claim 55, wherein the tumor cell 
growth is inhibited in vivo or in vitro . 

57. The method of claim 56, wherein the ligand is an 
antibody capable of binding to the TRAP interacting 
site of a TREX protein. 

58. The method of claim 57, wherein the antibody is a 
monoclonal or a polyclonal antibody. 

59. A pharmaceutical conposition comprising an amount of 
the oligonucleotide of any one of claims 40, 41, 42, 
43, or 44, effective to prevent overexpression of a 
TREX protein and a pharmaceutically acceptable 
carrier capable of passing through a cell membrane. 



60. 



A pharmaceutical conposition conprising an amount of 
the antibody of any one of claims 45, 46 or 47 
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effective to block binding of a TREX protein to a 
TRAF protein and a pharmaceutically acceptable 
carrier capable of passing through a cell membrane. 

61- A method of treating an abnormality in a siibject, 

wherein the abnormality is alleviated by the 
inhibition of binding of a TREX protein and a TRAF 
protein which conprises administering to the subject 
an effective amount of the pharmaceutical 
composition of claim 60 effective to block binding 
of the TREX protein cuid the TRAF protein in the 
subject, thereby treating the abnormality in the 
subject. 

62. The method of claim 61 , wherein the TRAF protein is 
TRAF2, TRAF3 or TRAF 5. 

63. The method of claim 62, wherein the abnormality is 
cancer, a hereditary multiple extosis or an 
autoimmune disease. 

64. The method of claim 63, wherein the cancer is colon 
cancer, gastric cancer, human head and neck sqpiamous 
cell carcinoma, prostate carcinoma, breast cancer, 
thyroid cancer, esophageal cancer, lung cancer, 
colorectal cancer, ovarian cancer, papillary bladder 
cancer, osteosarcoma, chondrosarcoma, liposarcoma, 
giant cell tumor, Ewing sarcoma, or other malignant 
tumors . 

65. A method of treating an abnormality in a siibject, 
wherein the abnormality is alleviated by the 
inhibition of overesqpression of a TREX protein which 
comprises administering to the subject an effective 
amount of the pharmaceutical composition of claim 53 
effective to inhibit overexpression of the TREX 
protein, thereby treating the abnormality in the 
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sub j ect . 

66. The method of claim 65, wherein the abnormality is 
cancer, a hereditary multiple extosis or an 

5 autoimmune disease. 

67. The method of claim 66, wherein the cancer is colon 
cancer, gastric cancer, human head and neck squamous 
cell carcinoma, prostate carcinoma, breast cancer, 

10 thyroid cancer, esophageal cancer, lung cancer, 

colorectal cancer, ovarian cancer, papillary bladder 
ccuicer, osteosarcoma, chondrosarcoma, liposarcoma, 
gicuit cell tirnior, Ewing sarcoma, or other malignant 
tixmors . 

15 

68. A method of screening for a chemical compound which 
inhibits TREX protein and TRAP protein binding 
comprising: 

(a) incubating the chemical compoimd with a TREX protein 
20 and a TRAP protein; 

(b) contacting the incubate of step (a) with an affinity 
meditim imder conditions so as to bind a TREX 
protein-TRAF protein complex, if such a complex 
forms; and 

25 (c) measuring the amount of the TREX protein-TRAF 

protein conplex formed in step (b) so as to 
determine whether the conpound is capcd3le of 
interfering with the formation of the complex 
between the TREX protein-TRAF protein. 

30 

69. The method of claim 68, wherein the TRAP is a TRAF2, 
TRAF3 or a TRAF 5. 

70. The method^TDf claim 69, wherein the compound is a 
35 CID40 receptor ligand. 

71. The method of claim 69, wherein the molecule is a 
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peptide or a fragment thereof which conqprises a TRAP 
binding domain. 

72, The method of claim 71, wherein the TRAP is a TRAF2, 

TRAF3 or a TRAP 5. 



73, A method of preventing inhibition of a CD40 signal- 
dependent NF-kB activation cotiprising administering 
the antisense oligonucleotide of claim 37 which 
binds to an mRNA molecule encoding a human Tumor 
necrosis factor Receptor -Associated Factor (TRAP) 
protein- interacting hereditary multiple extoses 
(TREX) protein so as to prevent inhibition of 
activation of CD40 signal -dependent NP-kB 
activation. 

74. A method of preventing inhibition of a CD40 signal - 
dependent NF-kB activation comprising administering 
a ligand comprising an amino acid domain which binds 
to a EXT C domain of the TREX protein so as to 
inhibit binding of the TREX protein to the TRAP 
protein, thereby preventing inhibition of a CD40 
signal -dependent NF-kB activation. 



75. The method of claim 74, wherein the ligand is 
peptide or a fragment thereof which comprises a TRAP 
binding domain. 

76. A method of preventing upregulation of a TNF 
receptor typell signal -dependent NF-kB activation 
comprising administering the antisense 
oligonucleotide of claim 37 which binds to an mRNA 
molecule encoding a human Tumor necrosis factor 
Receptor-Associated Factor (TRAP) protein- 
interacting hereditary multiple extoses (TREX) 
protein so as prevent upregulation of a TNF receptor 
typell signal -dependent NF-kB activation. 
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77. A method of preventing upregulation of activation of 
a TNF receptor typell-signal-dependent NF-fcB 
comprising administering a ligand conprising an 
amino acid domain which binds to a EXT C domain of 
the TREX protein so as to inhibit binding of the 
TREX protein to the TRAF protein, thereby preventing 
upregulation of activation of a TNF receptor typell- 
signal -dependent NF-kB. 

78. The method of claim 77, wherein the ligand is 
peptide or a fragment thereof which comprises a TRAF 
binding domain. 

79. A method of detecting a predisposition to cetncer 
\rtiich comprises detecting of a mutation in a nucleic 
acid encoding TREX protein in the sanple from the 
subject. 

80. The method of claim 79, wherein the mutation is a 
silent point mutation or a missense point mutation- 
al . The method of claim 79, wherein the mutation in the 

nucleic acid encoding TREX protein is detected by 
contacting the nucleic acid from the sample with a 
TREX nucleic acid probe under conditions permitting 
the TREX nucleic acid probe to hybridize with the 
nucleic acid from the sample i thereby detecting the 
mutation in the nucleic acid encoding TREX protein 
in the sample. 

82. The method of claim 81, wherein the cancer is colon 

cancer, gastric cancer, human head and neck squamous 
cell carcinoma, prostate carcinoma, breast cancer, 
thyroid cancer, esophageal cancer, lung cancer, 
colorectal cancer, ovarian cancer, papillary bladder 
cancer , osteosarcoma , chondrosarcoma , 1 iposarcoma , 
giant cell tumor. Swing sarcoma, or other malignant 
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tumors . 

83. The method of claim 81, wherein the TREX nucleic 

acid probe comprises a nucleic acid molecule of at 
least 15 nucleotides which specifically hybridizes 
with a iHiique sequence included within the sequence 
of an isolated nucleic acid molecule encoding a 
Tumor necrosis factor Receptor-TVssociated Factor 
(TRAP) protein- interacting hereditary multiple 
extoses (TREX) protein. 



84. The TREX nucleic acid probe of claim 81, %dierein the 
nucleic acid is DNA. 

85. The TREX nucleic acid probe of claim 81, wherein the 
nucleic acid is RNA. 

86. A TREX nucleic acid probe comprising a sequence 
capable of specifically hybridizing with a unique 
sequence included within the DNA molecule of claim 
2. 



87. A TREX nucleic acid probe comprising a sequence 

caped>le of specifically hybridizing with a unique 
sequence included within the inRNA molecule of claim 
5. 



88. . The TREX nucleic acid probe comprising a sequence 

capable of specifically hybridizing with a unique 
sequence included within the genomic DNA molecule of 
claim 4. 

89. The method of claim 79, wherein the mutation 
comprises a portion of a tumor suppressor locus. 



90. 



The method of diagnosing cancer in a siibject which 
comprises: 
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a) obtaining DICV from the sanple of a siibject suffering 
from cancer; 

b) performing a restriction digest of the DNA with a 
panel of restriction enzymes; 

c) separating the resulting DNA fragments by size 
fractionation; 

d) . contacting the resulting DNA fragments with a 

nucleic acid probe capable of specifically 
hybridizing with a unique sequence included within 
the sequence of a genetic alteration of a nucleic 
acid molecule encoding a TREX protein, wherein the 
nucleic acid is labeled with a detectable marker; 

e) detecting labeled bands which have hybridized to the 
nucleic acid probe in step (d) , wherein the sequence 
of a genetic alteration of a nucleic acid molecule 
encoding a TREX protein creates a unique band 
pattern specific to the DNA of subjects suffering 
from cemcer; 

f) preparing DNA obtained from a sample of a subject 
for diagnosis by steps (a-e) ; and 

g) coTiparing the detected band pattern specific to the 
DNA obtained from a sample of subjects suffering 
from cancer from step (e) and the DNA obtained from 
a sanple of the subject for diagnosis from step (f) 
to determine whether the patterns are the same or 
different and to diagnose thereby predisposition to 
cancer if the patterns are the same. 

91. The method of claim 90, wherein the size 

fractionation in step (c) is effected by a 
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polyacrylatnide or agarose gel. 

92. The method of claim 90, wherein the detectable 
marker is radioactive isotope ^ enzyme, dye, bio tin, 

5 a fluorescent label or a chemiluminescent label. 

93, A method of diagnosing cancer in a subject which 
comprises: 

10 a) obtaining RNA from the sample of the subject 

suffering from cancer; 

b) separating the RNA. sample by size fractionation; 

15 c) contacting the resulting RNA species with a nucleic 

acid probe capable of specifically hybridizing with 
a unique sequence included within the sequence of a 
nucleic acid molecule encoding a mutated TREX 
protein, wherein the sequence of the nucleic acid 

20 molecule encoding the mutated TREX protein is 

labeled with a detectable marker; 

d) detecting labeled bands which have hybridized to the 
RNA species to create a xinique band pattern specific 

25 to the RNA of. subjects suffering from cancer; 

e) preparing RNA obtained from a sample of a subject 
for diagnosis by steps (a-d) ; and 

30 f ) comparing the detected band pattern specific to the 

RNA obtained from a sample of subjects suffering 
from cancer from step (d) and the RNA obtained from 
a sample of the subject for diagnosis from step (f ) 
to determine whether the patterns are the same or 

35 different and to diagnose thereby predisposition to 

cancer if the patterns are the same. 
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94. The method of claim 93, wherein the size 
fractionation in step (c) is effected by a 
polyacrylamide or agarose gel. 

95. The method of claim 93, wherein the detectable 
marker is radioactive isotope, enzyme, dye, biotin, 
a fluorescent label ov a chemiluminescent label. 

96. The method of either of claim 90 or 93, wherein 
cancer associated with the eaqpression of a mutated 
TREX protein is diagnosed. 

97. The method of either of claim 90 or 93, wherein the 
ccoicer is colon cancer, gastric cancer, human head 
and neck squamous cell carcinoma, prostate 
carcinoma, breast cancer, thyroid cancer, esophageal 
cancer, lung cancer, colorectal cancer, ovaricoi 
cancer, papillary bladder cancer, osteosarcoma, 
chondrosarcoma, liposarcoma, giant cell tumor, Ewing 
sarcoma, or other malignant tumors- 
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FIG. 7A-1 



cctgatcgtt ggtagtggca tggaggacgg ggctggcatt tcagactgcc agctgttttt 
acx»gccgct gcatcacttg aatagaagct atgcatattg gctggccgac aaagccaagg 
gacaaaagct atggccgtta aaatggtccc tctgagtcca gggctctttc cctggctttt 
agcaccatgg atctcttcct tttcatccca tcagcaatgt ggtaccttct tctacttgat 
gatgacagct gatacttcag atttgcctga ctaaggttag aaacctgaat cgctgtgagg 
aagatgaaat ttccatttta cttggtgcct tgtgcaggga gcacactgat ccttccagaa 
acttgtgtgt gaaaagaggt: t:gcc^tttgt cagacagact catcfgttatg gcgagcgatc 
cgacgtgatc agagtgggca agaggcacag cgaactcatg ac^aggctata ccatgttigcg 
gaatggggga gtggggaacg gtggtxsagac ctgtatgctg cgctggtcca atcgcatccg 
gctgacat:gg ctgagtttca cgctgttcat catcctcgtc ttcttccccc tcattgctca 
ctattacctc accactctgg acgaggcaga cgaggctggc aagcgcatct tcggccctcg 
ggctggcagt gagctctgtg aggtaaagca tgtccttgat ctctgtcgga ttcgtgagtc 
tgtgagcgaa gagcttctac agctogaagc caagcggcag gagctgaaca gcgagattgc 
caagctgaac ctcaagattg aagcc^gtaa gaagagcata gagaatigcca agcaggacct 
gctgcagctc aagaatgtca ttagccagac agagcactcc tacaaggagc tgatggccca 
gaaccagccc asiactgtccc t:gccca^ccg actgctccct gagaaggacg atxrccggcct 
tccacccccc aaggtcactc ggggttgccg ccttcacaac tgctttgatt actctcgttg 
tcctctgacg tctggctttc ccgtctacgt ctatgacagt gaccagtttg cctttgggag 
ctacc^ggac cctttggtca agcaggctut tcaggctaca gtgagagcca acgtttatgt 
tacagaaaat gcggccatcg cctgcctgta tgtggtgtta gtgggagaaa tgcaagagcc 
CMtgtgctg cggcctgccg acctt gaaaa gcagctgttt tctctgccac actggaggac 
agatigggcac ciaccacgtaa titat:caacct g^cccggaag tcagacacac agaatctact 
gta c aa c gtc agtacaggcc gccatgtggc ccagtccacc ctctatgctg cccagtacag 
agctggcttt gacctggtcg tgtcacccct tgtccatgct atgtctgaac ccaacttcat 
ggaaatccca ccgcaggtgc cagttaagcg gaaatatctc ttcactttcc agggcgagaa 
gatcgagtct ctgagatcta gccttcagga ggcccgttcc ttcgaggaag agatggaggg 
cgaccc^ccg gccgactatg acgalicgcat cal:tgccacc ctiaaaggctg tacaggacag 
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FIG. 7A-2 



caagctggat caggtgctgg tagaattcac ttgcaaaaac cagccgaagc ctagcctgcc 
gactgagtgg gcactgtigtig gggagcggga agaccgcctg gagttactga agctctccac 
cttcgccctc atcatcactc ccggggraccc gcgcctgctc atttcatctg ggtgtgccac 
gcggctcttc gaggccctgg aggtgggggc cgtgccggtg gtgctcgggg agcaggtgca 
gctcccgtac cacgacatgc ^gcagtggaa cgaggccgcc ctggtgg^c ccciagcctcg 
cgtcacagag gtccacttcc tgttacgaag tctttcagac agtgatctgt tggccatgag 
gcggcaaggc cgctttctct gggagaccta cttctccacc gcagacac^ta tttttaatac 
cgtgctggcc atgattagga ctcgaattca gatcccagct gctcccatcc gggaagaggt 
agcggctgag atcccccatc gttcaggcaa agcagctgga actgacccca acatggctga 
caatggggac ctggacctgg ggcccrgrtaga gacagaacca ccctatgcct cacctaaata 
cctccgcaat ttcactctga ctgtcacaga ctgttaccgt ggctggaact ctgccccggg 
acggttccat ctttttcccc acacaccctt tgatcctgtg ttgccctctg aggccaaatt 
cttgggctca gggactggat ttcggccgat cggtggcggg gctgggggct ctggcaagga 
gttccaggca gcgctcggag gcaatgtcca gcgggagcag ttcacagttg tgatgctgac 
ctacgagcgg gaggaagtgc tcatgaactc cctggagaga ctcaacggcc tcccctacct 
gaacaaggta gtggtggtgt ggaactctcc caagctgccc tcggaggacc ttttgtggcc 
agacattggt gtccccatca tggtcgtccg tactgagaag aacagtttga acaatcggtt 
cttgccctgg aatgagattg agacagaggc catactgtcc atcgacgatg atgctcacct 
ccgccatgat gaaatcatgt ttgggttttg ggtgtggaga gaagcacgrtg atcgcattgt 
gggtttccct ggccggtacc atgcgtggga catcccgcac cagtcctggc tctacaattc 
caactactcc tgtgagctgt ccatggtgct gacgggcgct gccttctttc acaagtatta 
tgcctacctg tattcttatg tgatgcccca ggccatccgg gacatgg t gg acgagtacat 
caactigtgag gatatcgcca t:gaact:tcct tgtctcccac atcacacgga aaccccccat 
caaggtgaca tcaaggtgga cttttcgatg cccaggcftgc cctcaggccc tgtcccatga 
tgactctcat tttcacgagc ggcacaagtg tatcaacttt tttgtcaagg tgtacggcta 
tatgcctctc ttgtacacac agttcagggt ggactccgtg ctcttcciaga cccgcctgcc 
ccatg£icaag accaa g tgct tcaagttcat ctagggcctit gcagtitictga ggagacaatg 
agcagagcga gggggagtca ccctcaaggt tcccaaggtg tcgaaggtcc ttcfgggacat 
ctgtcgggca gggccaagac cctttgctgg gagaggcagc aggaagagtg gaaagggata 
gctgtctttc attttgaagt cagccacact gggcctggga tcctggtcag agactcaggn 
cgtctgcaca gggcactgac tgatagcgaa cactgaggac tgttcataag cccaggaca 
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FIG. 7B.1 



10 20 30 40 50 60 

cctgatcgttggtagtggcatggaggacggggctggcatttcagactgccagctgttttt 



70 80 90 100 110 120 

accagccgctgcatcacttgaatagaagctatgcatattggctggccgac3aagccaagg 



130 140 150 160 170 180 

gacaaaagctatggccgttaaaatggtccctctgagtccagggctctttccctggctttt 



190 200 210 220 230 240 

agcaccatggatctcttccttttcatcccatcagcaatgtigcrtaccttcttctacttgat 



250 260 270 280 290 300 

gatgacagctgatacttcagatttgcctgactaaggttagaaacctgaatcgctgtgagg 



310 320 330 340 350 360 

aagatgaaatttccattttacttggtgccttgtgcagggagcacactgatccttccagaa 



370 380 390 400 410 420 

acttgtgtgtgaaaagaggttgcgttttgtcagacagactcatggttatggcgagcgatc 



430 440 450 460 470 480 

cgacgtgatcagagtgggcaagaggcacagcgaactcatgacaggctataccatgttgcg 

MTGYTMLR 



490 500 510 520 530 540 

gaatgggggagtggggaacgg t gg t cagacctgrtatgctgcgctggtccaatcgcatccg 
NGGVGNGGQTCMLRWSNRIR 



550 560 570 580 590 600 

gctgacatggctgagtttcacgctgttcatcatcctcgtcttcttccccctcattgctca 
LTWLSFTLFIILVFFPLIAH 



610 620 630 640 650 660 

ctattacctcaccactctggacgaggcagacgaggctggcciagcgcatcttcggccctcg 
YYLTTLDEADEAGKRIFGPR 

670 680 690 700 710 720 

ggctggcagtgagctctgtgaggtaaagcatgtccttgatctctgtcggattcgtgagtc 

AGSELCEVKHVLDLCRIRES 
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FIG. 7B-2 

730 740 750 760 770 780 

tgtgagcgaagagcttctacagctcgaagccaagcggcaggagctgaacagcgagattgc 
VSEELLQLEAKRQELNSEIA 



790 800 810 820 830 840 

caagctgsiacctcaagattgaagcctgtaagciagagcatagagaatgccaagcaggacct: 
KLNLKIEACKKSIENAKQDL 

850 860 870 880 890 900 

gctgcagctcaagaatgtcattagccagacagagcactcctacciaggagctgatggccca 
LQLKNVISQTEHSYKELMAQ 

910 920 930 940 950 960 

gaaccagcccaaacHgtccctgcccatccgactgctccctgagaaggacgatgccggcct 
NQPKLSLPIRLLPEKDDAGL 



970 980 990 1000 1010 . 1020 

tccaccccccaaggtcactcggggttgccgccttcacaactgctttgattactctcgttg 
PPP KVTRGCRLHNCFDYSRC 



1030 1040 1050 1060 1070 1080 

tcctctgacgtctggctttcccgtctacgtctatgacagtgaccagtttgcctttgggag 
PLTSGFPVYVYDSDQFAFGS 



1090 1100 1110 1120 1130 1140 

ctacctggaccctttggtcaagcaggcttttcaggctacagtgagagccaacgtttatgt 
YLDPLVKQAFQATVRANVYV 



1150 1160 1170 1180 1190 1200 

tacagaaaatgcggccatcgcctgcctgtatgtggtgttagtgggagaaatgcaagagcc 
TENAAIACIiYVVLVGEMQEP 



1210 1220 1230 1240 1250 1260 

cactgtgctgcggcctgccgaccttgaaaagcagctgttttctctgccacactggaggac 
TVLRPADLEKQLFSLPHWRT 



1270 1280 1290 1300 1310 1320 

agatgggcacaaccacgtcattatcaacctgtcccggaagtcagacacacagaatctact 
DGHNHVIINLSRKSDTQNLL 

1330 1340 1350 1360 1370 1380 

gtacaacgtcagtacaggccgccatgtggcccagtccaccctctatgctgcccagtacag 
YNVSTGRHVAQSTLYAAQYR 



1390 1400 1410 1420 1430 1440 

agctggctttgacctggtcgtgtcaccccttgtccatgctatgtctgaacccaacttcat 
AGFDLVVSPLVHAMSEPNFM 



1450 1460 1470 1480 1490 1500 

ggaaatcccaccgcaggtgccagttaagcggaaatatctcttcactttccagggcgagaa 
EI PPQVPVKRKYLFTF QGEK 
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FIG. 7B-3 

1510 1520 1530 1540 1550 1560 

gatcgagtctctgagatctagccttcaggaggcccgttccttcgaggaagagatggaggg 
lESLRSSLQEARSFEEEMEG 



1570 1580 1590 1600 1610 1620 

cgaccctccggccgactatgacgatcgcatcattgccaccctaaaggctgtacaggacag 
DPPADYDDRIIATLKAVQDS 



1630 1640 1650 1660 1670 1680 

caagctggatcaggtgctggtagaattcacttgcaaaaaccagccgaagcctagcctgcc 
KLDQVLVEFT. CKNQ PKPSLP 



1690 1700 1710 1720 1730 1740 

gactgagcgggcactgtgtggggagcgggaagaccgcctggagttactgaagctctccac 
TEWALCGEREDRLELLKLST 



1750 1760 1770 1780 1790 1800 

ctrcgccctcatcatcactcccggggacccgcgcctgctcatttcatctgggtgtgccac 
FALIITPGDPRLLISSGCAT 



1810 1820 1830 1840 1850 1860 

gcggctcttcgaggccctggaggtgggggccgtgccggtggtgctcggggagcaggtgca 
RLFEALE VGAVPVVLGEQVQ 

1870 1880 1890 1900 1910 1920 

gctcccgtaccacgacatgctgcagtggaacgaggccgccctggtggtgcccaagcctcg 
LPYHDMLQWNEAALVVPKPR 



1930 1940 1950 I960 1970 1980 

cgtcacagaggtccacttcctgttacgaagtctttcageuagtgatctgttggccatgag 
VTEVHFLLRSLSDSDLLAMR 



1990 2000 2010 2020 2030 2040 

gcggcaaggccgctttctctgggagacctacttctccaccgcagacagtatttttaatac 
RQGRFLWETYFSTADS IFNT 



2050 2060 2070 2080 2090 2100 

cgtgctggccatgattaggactcgaattcagatcccagctgctcccatccgggaagaggt 
VLAMIRTRIQIPAAPIREEV 



2110 2120 2130 2140 2150 2160 

agcggccgagatcccccatcgttcaggcaaagcagctggaactgaccccaacatggctga 
AAEIPKRSGKAAGTDPNMAD 



2170 2180 2190 2200 2210 2220 

caacggggacccggacctggggccggcagagacagaaccaccctataccncacccaaata 
NGDLDLGPVZTEPPYASFKY 



2230 2240 2250 2260 2270 2280 

cccccgcaatttcaccccgactgccacagactgtcaccgcggccggaacrccgccccggg 
LRNFTLTVTDCYRGWNSAPG 
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FIG. 7B-4 

2290 2300 2310 2320 2330 2340 

acggttccatctttttccccacacaccctttgatcctgtgttgccctctgaggccaaatt 
RFHLFPHTPFDPVLPSEAKF 



2350 2360 2370 2380 2390 2400 

cttgggctcagggactggatttcggccgatcggtggcggggctgggggctctggcaagga 
LGSGTGFRPIGGGAGGSGKE 

2410 2420 2430 2440 2450 2460 

gttccaggcagcgctcggaggcaatgtccagcgggagcagttcacagttgtgatgctgac 
FQAALGGNVQREQFTVVMLT 

2470 2480 2490 2500 2510 2520 

ctacgagcgggaggaagtgctcatgeiactccctggagagactcaacggcctcccctacct 
YEREEVLMNSLERLNGLPYL 



2530 2540 2550 2560 2570 ^ 2580 

gaacaaggtagtggtggtcrtggaactctcccaagctgccctcggaggaccttttgtggcc 
NKVVVVWNSPKLPSEDLLWP 



2590 2600 2610 2620 2630 2640 

agacattggtgtccccatcatgg t cg t ccgtactgagaagaacagtttgaacaatcggtt 
DIGVPIMVVRTEKNSLNNRF 



2650 2660 2670 2680 2690 2700 

cttgccctggaatgagattgagacagaggccatactgtccatcgacgatgat^gctcacct 
LPWNEIETEAILSIDDDAHL 



2710 2720 2730 2740 2750 2760 

ccgccatgatgaaatcatgtttgggttttgggtgtggagagaagcacgtgatcgcattgrt 
RHDEIMFGFWVWREARDRIV 

2770 2780 2790 2800 2810 2820 

gggtttccctggccggtaccatgcgtgggacatcccgcaccagtcctggctctacaattc 
6FPGRYHAWDIPHQSWLYNS 



2830 2840 2850 2860 2870 2880 

caactactcctgtgagctgtccatggtgctgacgggcgctgccttctttcacaagtatta 
NYSCELSMVLTGAAFFHKYY 

2890 2900 2910 2920 2930 2940 

tgcctacctgtattcttatgtgatgccccaggccatccgggacatggtggacgagtacat 
AYLYSYVMPQAIRDMVDEYI 

2950 2960 2970 2980 2990 3000 

caactgtgaggatatcgccatgaacttccttgtctcccacatcacacggaaaccccccat 
NCEDIAMNFLVSHITRKPPI 

3010 3020 3030 3040 3050 3060 

caaggtgacatcaaggtggacttttcgatgcccagggtgccctcaggccctgtcccatga 
KVTSRWTFRCPGCPQAIiSHD 
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FIG. 7B-6 

3O70 3080 3090 3100 3110 3120 

tgactct^ttttcacgagcggcacaagtgtotcaacttttttgtcaaggtgtaOT 
DSHFHERHKCINF.FVKVYGY 

3130 3140 3150 3160 3170 3180 

t^tgcctctcttgtacacacagttcagggtOTactccgtgctcttcaagaccc^^^ 
MPLLYTQFRVDSVLFKTRi^*' 

3190 3200 3210 3220 3230 3240 

ccatgacaagaccaagtgcttcaagttcatctagggccttgcagttctgaggagacaatg 

HDKTKCFKFI * 

3250 3260 3270 3280 3290 3300 

agcagagcgagggggagtcaccctcaaggttcccaaggtgtcgaaggtccttggggacat 

3310 3320 3330 3340 3350 3360 

ctgtcgggcagggccaagaccctttgctgggagaggcagcaggaagagtggaaagggata 

3370 3380 3390 3400 3410 3420 

gctgtctttcatttt^aagtcagccacactgggcctgggatcctggtcagagactcaggn 

3430 3440 3450 3460 3470 
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FIG. 8Ar1 



ggcgggtccc tgagctggaa gccggagagc aagccctgga ggttcactct ttcaagaagt 
cgtgtgctga ggtgtaatgc tacacaagtc agaggaagga agggtcctga aacacatggc 
ctgattgttg gcaaaggcat cataagaagc tggcatttat ttctgrttcta acctattact 
gtataactgt gaat:agacac tatgcatatt tgttggtcag caaaaccaag aaacaagagc 
tatggcattt gaciaaagtct gtctgattcc agggtgtttt tcctgggttt catcatcagg 
tacctxrctcc ctttcatctc agcaagaatg tggcaccttt tatcgtttga taaagattaa 
ggacatgttc tttggtceiac agccagaact taaaatctgc tggaat:aggg tcagagacca 
tttcagctgc agctgaggaa aatgaaatgt tcattttatt tggtgccttg tctggggagc 
acactaactc ttctggraaac gtgtcagtga aacagagatc gttttgtgga atagcaaccc 
atggttatgg cgagtgaccc gacgtgatct gygyggc agg ctgcagagga ctcatgacag 
gctataccat gctgcggatat gggggcgcgg ggaacggagg tcagacctgc atgctgcgct 
ggtccaaccg catccgcctc acgtggctca gcttcacgct ctttgtcatc ctggtcttct 
tcccgctcat cgcccactat tacctcacca ctctggatga ggctgatgag gcaggcaagc 
ggatttttgg tccccgggtg gggaacgagc tgtgcgaggt gaagcacgtg ctggatctgt 
gccgcatccg ggagtcggtg agtgaagagc tcctgcagct ggaggccaag cgccaagagc 
tgaacagcga gatcgccaag ctgaatctga agatcgaagc ctgtaagaag agcattgaga 
acgccaagca ggacctgctc cagctcaaga atgtcatcag ccagaccgag cattcctaca 
aggagctcat ggcccagauic cagcccaagc tgtccctgcc catccgactg ctcccagaga 
aggacgatgc cggcctccct cccccgaagg ccactcgggg ctgccggcta cacciactgct 
ttgattattc tcgttgccct ctcacctctg gcttcccggt ctacgtctat gacagtgacc 
agtttgtctt tggcagctac ctggatccct tggtcaagca ggcttttcag gcgacagcac 
gagctaacgt ttatgttaca gaaaatgcag acatcgcctg cctttacgtg atactagtgg 
gagagatgca ggagcccgtg gtgctgcggc ctgctgagct ggagaagcag ttgtattccc 
tgccacactg gcggacggat ggacacaacc atgtcaticat caatctgtca cgtaagtccig 
atacacagaa ccttctctat aacgtcagta ctggccgtgc catggtggcc cagtccacct 
tctacactgt ccagtacaga cctggctttg acttggtcgt atcaccgctg gtccatgcca 
tgtctgagcc caacttcatg gaeiatcccac cacaggtgcc ggtgaagcgg aaatatctct 
tcaccttcca gggcgagaiag attgagtctc tgaggtctag ccttcaggag gcccgctcct 
tcgaagagga eiatggagggc gaccctcccg ccgactacga tgaccggatc attgccaccc 
tgaaggcggt gcaggacagc aagctggatc aggtcctggt ggaattcacc tgca aaa acc 
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FIG. 8A-2 



agcccaaacc cagcctgccg actgagtggg 
fiattgctgaa gctctccacc ttcgccctca 
tttcctctgg gtgtgcaaca cggctcttcg 
tgctggggga gcaggtccag cttccctacc 
tggtggtgcc euiagcctcgrt gttaccgagg 
gtgacctcct ggctatgagg cggcaaggcc 
ctgacagtat ttttaatacc gtgctggcta 
ctcccatccg ggaagaggcg gcagctgaga 
ctgaccccaa catggctgac aacggggacc 
cctacgcctc acccagatac ctccgcaatt 
gctggaactg tgctccaggg cctttccatc 
tgccctcaga ggccaaattc ttgggctcag 
ctgggggttc tggcaaggaa tttcaggcag 
tcacggtggt gatgttgact tatgagcggg 
tgaatggcct cccttacctg aacaaggtcg 
cagaggacct tctgtggcct gacattggcg 
acagtttgaa caaccgattc ttaccctgga 
ttgatgacga tgctcacctc cgccatgacg 
aagctcggga ccgcatcgtg ggcttccctg 
agtcctggct ctacaactcc aiactactcct 
ccttctttca caagtattat gcctacctgt 
acatggtgga tgaatacatc aactgtgagg 
tcactcggaa gccccccatc ciaggtgacct 
ctcaggccct gtctcatgat gactcccact 
tcgtgaaggt gtacggctac atgcccctcc 
tcttcciagac acgcctgccc catgacaaga 
gcacggtctg gggaagagga ^gagcagagg 
ttgcaggacc ttgggcacat ctgctggtgg 
caggaggagt ggaaggaaac cgctgccttt 
ccctgggcgg agtccccggg gttccccaca 
actgtggcga ctctgcagag tcactcacac 
gtttttacat tcaataacaa ctattatgat 
attcaaggct tatttatata tatgtgtgtg 



cactgtgtgg agagcgggag gaccgcttgg 
tcattacccc cg g g g accct cgcttggtta 
aagccctgga agtcggtgcc gtcccggtgg 
aggacatgct gcagtggraac gaggcggccc 
ttcatttcct gctcagaagc ctctccgata 
gctttctctg ggagacttac ttctccactg 
tgattaggac tcgcatccag atcccagccg 
tcccccaccg ttcaggc^iag gcggctggaa 
tggacctggg gccagtggag acggagccgc 
tcactctgac tgtcactgac ttttaccgca 
ttttccccca cactcccttt gatectgtgt 
ggactggctt tcggcctatt ggtggtggag 
cgcttggagg caatgttccc cgagagcagt 
aggaagtgct tatgaactct ttagagaggc 
tggtggtgtg gaattctccc aagctgccat 
ttcccatcat ggtggtccgt actgagaaga 
atgaaattga gacagaggcc atcctgtcca 
aaatcatgtt tgggttccgg gtgtggagag 
gccgttacca cgcatgggac atcccccatc 
gtgagctgtc catggtgctg acaggtgctg 
attcttatgt gatgccccag gccatccggg 
£K:attgccat gaacttcctt gtctcccaca 
cacggtggac attccgatgc ccaggatgcc 
tccacgagcg gcacaagtgc atcaacttct 
tgtacacgca gttcagggtg gattctgtgc 
ccaagtgctt caagttcatc taggggcagc 
gaggciagatg gctcccetagg ttcctaggca 
gtggcccaga gcctctgctg gaaggggcag 
atcttgaagt cagccacact gggcctggag 
cagggcactg actgatagct tacactgagg 
cgttcgtacg cccaggacag ctggttcgtg 
tatttaaaaa gagaaagttt cagatttgcc 
tatataaata catgcacaca cttgcat:aca 
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tatatatttt tggctggggg agtgtgagtt ttgcctttct aagggaggga ccgcgcaggc 
tcctttgttc tgtattctgg cggagatggg tcctggcctt gtgtcactgg cttatcctta 
aagatcatct cccatccHcc ccagcgccat ctgtgtgcag caaccagsuia gggatgeiact 
tggccctctt gcgggcctgg acaaggtctc ttccttaccc tttctgttgc cagtcagcaa 
cctgt5iactc acattctctt cccagtgaat ccctgggagc gcctgaccct gg t g g gctgt 
tcagcttcct gctgctgggg ccagcgattt ttgaggattt atctttaggc caggcttgcc 
tccgtactta tccctgctct cccatittctc tcttgtttga gagagaatga ggaagcaaag 
agtgagaaag aataggggct gaagacgcca ctcccagatg gctctttcta tcctgctctt 
ctgttgaaac acacgtgctg tgggcctcag gcgtttctga agtgctcttt cttggattgg 
acaggagatc agcagcgtgc acatctgctg tggtctgaag tggtttgcag gtcagcctcc 
tctccctagt gtagagcaag ccagtgtcct tcgaggaacc cacccggctg gccgggaagt 
tttacagcaa ggcgcctgcc ttgggataat tccttggtga aattcacctt ccccccgcct 
ctgtctggag ccccatcctg tgttatctgt ggtttttgga cccctaatgt cagcttggct 
gtaggactcc ccgaggtttg gtatgtgcta gaacaatggg aggctgtgat ttgctgtgta 
agctcacatc cagccttgga atctaaoggg cattcacaac cogacrttacc actttccact 
ccctgcttag gattctgttc cctgggctga aactgaaata agctaatttt ttgggtcacg 
gtggcagtag gggaacctag gagggtgtga gtggcatttg tcagggattt agcccatgac 
gtgtttcttg aaccctactt tctggaagtg gagttgactc tggaagtttt ctagcaactg 
aacaaaagct caggtttgtc ctggtcatgc acatgcctta agccagttcc gtcttcccta 
gaccttggca tcctgtgctt ctatttcttg gaatacgttc tcctctgacc tgcctgtacc 
acgtgggtcc tcttcaagta ctgttttgaa gctgggctct tttgtgtagc tcccacccac 
ctgtagggct agctcggctt aagggaactc tccccattgg caaaccggac ccggccgccg 
ccaggactgt gtttccaaag gttccccgcc cccaacccca gcatcagcct gtagctcccc 
tgctgaggca gtgtggttiat gt:t:cccagca gt g g ggyt ca gacgcccttc ctcagaactt 
tctagttgcc ctctacctga ctcctgactt gtattccttt tagcagtagc cttcttccct 
cggggagcca aagagtgtgg tgtgtggcgc tatattgtgg ctgctatttc atctggtttc 
ttttaatgtg aggaactcac atactgactt cagtgggact cggtgagccg ggg c cgtctg 
tgtggtggga ccccctttag cgggactcsig tgagctgggg ccgtctgtgt ggtggagcca 
gggcctctcc ctttagtgga gccaggttgt cgggccccga atgtcactgg tggatctaag 
aagggctgag tggtctgaca ccaaaacatg ccgcagggag ggctgtggtg ccggtgcttc 
caacaaggac agccctcctt gaccctgaaa ggaacactgg cttgaiaggac tgcagacagg 
ctctgagggg cacgccctcc tcagcgagag gcagcaaggt ggccacagtg tcactggtca 
ggtgcttctc accacgggaa agccgccgac ctgtgactcg cttgagatgg gaaagcggcg 
ccacagaccc cgggtctcct tggctgtctg tgggccgccc ctggccacct tgtcctggct 
cgcagggtgc aggagcgcct cgttctctgg gtggccggct tgctgctccg gtttgggctg 
tcttaccata acacccftccc cigggctctgc aggccactgt gagcgctggc tccctgggca 
gtgctccticc gtgtggactg tgccteaggc cagggctcac cagct yg g g t cctgtccgga 
aggatgggat ctttctggga gctgcgccgg acagagtggg gagctcctag tttgtggggg 
gaagctttga tatccatgcc acgtccatcc accccacccc ttttcgtcac gagcacaatg 
gtcttacatt ggatttttgt aaaaaaataa aaataaatgg agactttciac tc 
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FIG. 8B-1 

10 20 30 40 50 60 

ggcgggtccctgagctggaagccggagagcaagccctggaggttcactctttccuigaagt 



70 80 . 90 100 110 120 

cgtgtgctgaggtgtaatgctacacaagtcagaggaaggaagggtcctgaaacacatggc 



130 140 150 160 170 180 

ctgattgttgg ca aa g gcatcataagaagctggcatttatttctgttcteiacctattact 



190 200 210 220 230 240 

gtataactgtgaatagacactatgcatatttgttggtcagcaaaaccaagaaacaagagc 



250 260 270 280 290 300 

tatggcatttgaaaaagtctgtctgattccagggtgtttttcctgggtttcatcatcagg 



310 320 330 340 350 360 

tacctcctccctttcatctcagcaagaatgtggcaccttttatcgtttgataaagattaa 



370 380 390 400 410 420 

ggacatgttc^ttggtcaacagccagaacttaaaatctgctggaatagggtcagagacca 



430 440 450 460 470 480 

tttcagctgcagctgaggaaaatgaaatgttcattttatttggtgccttgtctggggagc 



490 500 510 520 530 540 

acactaactcttctggaaacgtgtcagtgaaacagagatcgttttgtggaatagcaaccc 



550 560 570 580 590 600 

atggttatggcgagtgacccgacgtgatctggggggcaggctgcagaggactcatgacag 

M T G 



610 620 630 640 650 660 

gctataccatgctgcggaatgggggcgcggggaacggaggtcagacctgcatgctgcgct 
YTMLRNGGAGNGGQTCMLRW 

670 680 690 700 710 720 

ggtccaaccgcatccgcctcacgtggctcagcttcacgctctttgtcatcctggtcttct 
SNRIRLTWLSFTLFVILiVFF 
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730 740 750 760 770 780 

tcccgctcatcgcccactattacctcaccactctggatgaggctgacgaggcaggcaagc 
PLIAHYYLTTLDEADEAGKR 



790 800 810 820 830 840 

ggatttttggtccccgggtggggaacgagctgtgcgaggtgaagcacgtgctggatctgt 
IFGPRVGNELCEVKHVLDLC 



850 860 870 880 890 900 

gccgcatccgggagtcggtgagtgaagagctcctgcagctggaggcc3agcgccaagagc 
RIRESVSEELiLQLEAKR Q E L 

910 920 930 940 950 960 

tgaacagcgagatcgccaagctgaatctgaagatcgaagcctgtaagaagagcattgaga 
NSEIAKLNLKIEACKKSIEN 



970 980 990 1000 1010 1020 

acgccaagcaggacctgctccagctcaagaatgtoatcagccagaccgagcattcctaca 
AKQDLLQLKNV ISQTEHSYK 

1030 1040 1050 1060 1070 1080 

aggagctcatggcccagaaccagcccaagctgtccctgcccatccgactgctcccagaga 
ELMAQNQPKLSLPIRLLPEK 



1090 UOO UIO 1120 1130 1140 

aggacgatgccggcctccctcccccgaaggccactcggggctgccggctacaceiactgct 
DDAGLPPPKATRGCRLHNCF 



1150 1160 1170 1180 1190 1200 

ttgattattctcgtitgccctcticacctctggcttcccggtctacgtctatgacagtgacc 
DYSRCPLTSGFPVYVYDSDQ 



1210 1220 1230 1240 1250 1260 

agtttgtctttggcagctacctggatcccttggtcaagcaggcttttcaggcgacagcac 
FVFGSYLDPLVKQAFQATAR 



1270 1280 1290 1300 1310 1320 

gagctaacgtttatgttacagaaaatgcagacatcgcctgcctttacgtgatactagtgg 
ANVYVTENADIACLYVILVG 



1330 1340 1350 1360 1370 1380 

gagagatgcaggagcccgtggtgctgcggcctgctgagctggagaagcagttgtattccc 
EMQEPVVLRPAELEKQLYSL 



1390 1400 1410 1420 1430 1440 

tgccacactggcggacggatggacacaaccatgtcatcatcaatctgtcacgtaagtcag 
PHWRTDGHNHVIINLSRKSD 



1450 1460 1470 1480 1490 1500 

atacacagaaccttctctataacgticagtactggccgtgccatggtggcccagtccacct 
TQNLLYNVSTGRAMVAQSTF 
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FIG. 8B-3 

1510 1520 1530 1540 1550 1560 

tctacactigtccagtacagacctggctttgacttggtcgtatcaccgctggtccatgcca 
YTVQYRPGF D L VVSPLVHAM 



1570 1580 1590 1600 1610 1620 

tgtctgagcccaacttcatggaaatcccaccacaggtgccggtgaiagcggaaatatctct 
SEPNFMEIPPQVPVKRKYLF 



1630 1640 1650 1660 1670 1680 

tcaccttccagggcgagaagattgagtccctgagcrtctagccttcaggaggcccgctcct 
TFQ6EKIESL. RSSLQEAR SF 

1690 1700 1710 1720 1730 1740 

tcgaagaggaaatggagggcgaccctcccgccgactacgatgaccggatcattgccaccc 
EEEMEGDPPADYDD'RI lATL 



1750 1760 1770 1780 1790 1800 

tgaaggcggtgcaggacagcaagctggatcaggtcctggtggaattcacctgcaaaaacc 
KAVQDSKLDQVLVEFTCKNQ 



1810 1820 1830 1840 1850 1860 

agcccaaacccagcctgccgactgagtgggcactgtgtggagagcgggaggaccgcttgg 
PKPSLPTEWALCGEREDRLE 



1870 1880 1890 1900 1910 1920 

aattgctgaagctctccaccttcgccctcatcattacccccggggaccctcgcttggtta 
LLKLSTFALIITPGDPRIiVI 



1930 1940 1950 i960 1970 1980 

tttcctctgggtgtgcaacacggctcttcgaagccctggaagtcggtgccgtcccggtgg 
SSGCATRLF EALEVGAVPVV 



1990 2000 2010 2020 2030 2040 

tgctgggggagcaggtccagcttccctaccaggacatgctgcagtggaacgaggcggccc 
LGEQVQLPYQDMLQWNEAAL 

2050 2060 2070 2080 2090 2100 

tggtggtgccaaagcctcgtgttaccgaggttcatttcctgctcagaagcctctccgata 
VVPKPRVTEVHFLLRSLS DS 

2110 2120 2130 2140 2150 2160 

gtgacctcctggctatgaggcggcaaggccgctttctctgggagacttacttctccactg 
DLLAMRRQGRFLWETYFSTA 



2170 2180 2190 2200 2210 2220 

ctgacagtatttttaataccgtgctggctatgattaggactcgcatccagatcccagccg 
DSIFNTVLAMIRTRIQIPAA 

2230 2240 2250 2260 2270 2280 

ctcccatccgggaagaggcggcagctgagatcccccaccgttcaggcaeiggcggctggaa 
PIREEAAAEIPHRSGKAAGT 
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2290 2300 2310 2320 2330 2340 

ctgaccccaacatggctgacaacggggacctggacctggggccagtggagacggagccgc 
DP NMADNGDLDLGPVETEPP 

2350 2360 2370 2380 2390 2400 

cctacgcctcacccagatacctccgcaatttcactctgactgtcactgacttttaccgca 
YASPRYLRNFTLTV T D F Y R S 



2410 2420 2430 2440 2450 2460 

gctggeuictgtgctccagggcctttccatcttttcccccacactccctttgaccctgtgt 
WNCAPGPFHL.FPHTPFDP VL 

2470 2480 2490 2500 2510 2520 

tgccctcagaggccaaattcttgggctcagggactggctttcggcctattggtggtggag 
PSEAKFLGSGTGFRPIGGGA 



2530 2540 2550 2560 2570 2580 

ctgggggttctggcaaggaatttcaggcagcgcttggaggceiatgttcccc^gagcagt 
GGSGKEFQAALGGNVPREQF 



2590 2600 2610 2620 2630 2640 

tcacggtggtgatgttgacttatgagc gtfga g ga agtgcttatgaactctttagagaggc 
TVVMXiTYEREEVLHNS LERL 



2650 2660 2670 2680 2690 2700 

tgaatggcctcccttacctgaacaaggtcgtggtggtgtggauittctcccaagctgccat 
NGLPYLNKVVVVWNSPKLPS 



2710 2720 2730 2740 2750 2760 

cagaggaccttctgtggcctgacattggcgttcccatcatggtggtccgtactgagaaga 
EDLLWPDIGVPIMVVRTEKN 



2770 2780 2790 2800 2810 2820 

acagtttgaacaaccgattcttaccctggaatgaaattgagacagaggccatcctgtcca 
SLNNRFLPWNEI ETEAILSI 



2830 2840 2850 2860 2870 2880 

ttgatgacgatgctcaccticcgccatgacgaaatcatgtttgggttccgggtgtggagag 
DDDAHLRHDEIMFGFRVWRE 



2890 2900 2910 2920 2930 2940 

aagctcgggaccgcatcgtgggcttccctggccgttaccacgcatgggacatcccccatc 
ARDRIVGFPGRYHAWDIPHQ 

2950 2960 2970 2980 2990 3000 

agtcctggctctacaactccaactactcctgtgagctgtccatggtgctgacaggtgctg 
SWLYNSNYSCEL SMVLTGAA 

3010 3020 3030 3040 3050 3060 

ccttctttcacaagtattatgcctacctgtattcttatgtgatgccccaggccatccggg 
FFHKYYAYLYSYVMPQAIRD 
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3070 3080 3090 3100 3110 3120 

acatggtggatgaatacatcaactgtgaggacattgccatgaacttccttgtctcccaca 
MVDEYINCEDIAMNFLVSHI 

3130 3140 3150 3160 3170 3180 

tcactcggaagccccccatcaaggtgacctcacggtggacattccgatgcccaggatgcc 
TRKPPIKVTSRWTFRCPGCP 



3190 3200 3210 3220 3230 3240 

ctcaggccctgtctcatgatgactcccacttccacgagcggcacaagtgcatcaacttct 
QALSHDDSHPHERHKCI N F F 



3250 3260 3270 3280 3290 3300 

tcgtgaaggtgtaoggctacatgcccctcctgtacacgcagttcag g g t g g attctgtgc 
VKVYGYMPLLYTQFRVDSVL 



3310 3320 3330 3340 3350 ^ 3360 

tctitcaagacacgcctgccccatgacaagaccaagtgcttcaagtitcatctaggggcagc 
FKTRLPHDKTKCFKFI * 



3370 3380 3390 3400 3410 3420 

gcacggtctggggaagaggatgag c a g agggaggaagatggctcccaaggttcctaggca 



3430 3440 3450 3460 3470 3480 

ttgcaggaccttgggcacatctgctgg Lyy y Lggcccagagcctctgc tggaaggggcag 



3490 3500 3510 3520 3530 3540 

caggaggagtggciaggaaaccgctgcctttatcttgaagtcagccacactgggcctggag 



3550 3560 3570 3580 3590 3600 

ccctgggcggag tccccgggg ttccccacacagggcactgac tgatagcttacactgagg 



3610 3620 3630 3640 3650 3660 

actgtggcgactctgcagagtcactcacaccgttcgtacgcccaggacagctggttcgtg 



3670 3680 3690 3700 3710 3720 

gtttttacattcaataacaactattatgattatttaaaaagagaaagtttcagatttgcc 



3730 3740 3750 3760 3770 3780 

attcaaggcttatttatatatatgtgtgtgtatataaatacatgcacacacttgcataca 

3790 3800 3810 3820 3830 3840 

tatatatttttggctgggggagtgtgagttttgcctttctaagggagggaccgcgcaggc 
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3850 3860 3870 3880 3890 3900 

tcctttgttctgtattctggcggagatgggtcctggccttgtgtcactggcttatcctta 



3910 3920 3930 3940 3950 3960 

aagatcatctcccatcctccccagcgccatctgtgtgcagcaaccagaaagggatgaact 



3970 3980 3990 4000 4010 4020 

tggccctcttg c gyg c ctgga c aaggtctgttccttaccctttctgttgccagtcagcaa 



4030 4040 4050 4060 4070 4080 

cctg taac tceusattctcttcccagtgaatccctgggagcgcctgaccctyy Lyyyc tgt 



4090 4100 4110 4120 4130" 4140 

tcagcttcctgctgct y g gg ccagcgattttitgaggatttatctttaggccaggcttgcc 



4150 4160 4170 4180 4190 4200 

tccgtacttatccctgctctcccatttctctcttgtttgagagagaatgaggaagcaaag 



4210 4220 4230 4240 4250 4260 

agtgagaaagaataggggctgaagacgccactcccagatggctctttctatcctgctctt 



4270 4280 4290 4300 4310 4320 

ctgttgaaacacacgtgctgtgggcctcaggcgtttctgaagtgctctttcttggattgg 



4330 4340 4350 . 4360 4370 4380 

acaggagatcagcagcgtgcacafectgctgtggtctgaagtggtttgcaggtcagcctcc 



4390 4400 4410 4420 4430 ' 4440 

tctccctagtgtagagcaagccagtgtccttcgaggaacccacccggctggccgggaagt 



4450 4460 4470 4480 4490 4500 

tttacagcaaggcgcctgccttgggataattccttggtgaaattcaccttccccccgcct 



4510 4520 4530 4540 4550 4560 

ctgtctggagccccatcctgtgttatctgtggtttttggacccctaatgtcagcttggct 



4570 4580 4590 4600 4610 4620 

gtaggactccccgaggtttggtatgtgctagaacaatgggaggctgtgatttgctgtgta 
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4630 4640 4650 4660 4670 4680 

agctcacatccagccttggaatctaacgggcattcacaacccgagttaccactttccact 



4690 4700 4710 4720 4730 4740 

ccctgcttaggattctgttccctgggctgaaactgaaataagctaattttttgggtcacg 



4750 4760 4770 4780 4790 4800 

gtggcagtaggggaacctaggagggtgtgagtggcatttgtcagggatttagcccatgac 



4810 4820 4830 4840 4850 4860 

gtgtttcttgaaccctactttctggaagtggagttgactctggaagttttctagcaactg 



4870 4880 4890 4900 4910 4920 

ciacaaaagctcaggtttgtcctggtcatgcacatgccttaagccagttccgtcttcccta 



4930 4940 4950 4960 4970 4980 

gaccttggcatcctgtgcttctatttcttggaataqgttctcctctgacctgcctgtacc 



4990 5000 5010 5020 5030 5040 

acgtgggtcctcttceiagtactgttttgaagctgggctcttttgtgtagctcccacccac 



5050 5060 5070 5080 5090 5100 

ctgtagggctagctoggcttaagggaactctccccattggcaaaccggacccggccgccg 



5110 5120 5130 5140 5150 5160 

ccaggactgtgtttccaaaggttccccgcccccaaccccagcatccigcctgtagctcccc 



5170 5180 5190 5200 5210 5220 

tgctgaggcagtgtggrttatgttcccagcagtgggggtcagacgcccttcctcagaactt 



5230 5240 5250 5260 5270 5280 

tctagttgccctctacctgactcctgacttgtattccttttagcagtagccttcttccct 



5290 5300 5310 5320 5330 5340 

cggggagccaaagagtgtggtgtgtggcgctatattgtggctgctatttcatctggtttc 



5350 5360 5370 5380 5390 5400 
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FIG. 8M 

5410 5420 5430 5440 5450 5460 

tgtggtgggaccccctttagcgggactcagtgagctggggcc^ctgtgtggtggagcca 



5470 5480 5490 5500 5510 5520 

gggcctctccctttagtggagccaggttgtcgggccccgaatgtcactggtggatctaag 



5530 5540 5550 5560 5570 5580 

aagggctgagtggtctgacaccaaaacatgccgcagggagggctgtggtgccggtgcttc 



5590 5600 5610 5620 5630 5640 

caacaaggacagccctccttgaccctgriaaggaacactggcttgciaggactgcagacagg 



5650 5660 5670 5680 5690^ 5700 

ctctgaggggcaogccctcctcagcgagaggcagcaaggtggccacagtgtcactggtca 

5710 5720 5730 5740 5750 5760 

ggtgcttc tcaccacggga aagccsgccgacctgtgac tcgc ttgagatggga h agcggcg 

5770 5780 5790 5800 5810 5820 

ccacagaccccgggtctccttggctgtctgtgggccgcccctggccaiccttgtcctggct 



5830 5840 5850 5860 5870 5880 

cgcagggtgcaggagcgcctcgttctctgggtggccggcttgctgctccggtttgggctg 



5890 5900 5910 5920 5930 5940 

tcttaccataacaccgtcccagggctctgcaggccactgtgagcgctggctccctgggca 



5950 5960 5970 5980 5990 6000 

gtgctcctccgtgtggactgtgcctcaggccagggctccu:cagctggggt:cctgtccgga 



6010 6020 6030 6040 6050 6060 

aggaCgggatc^^tetgggagctgcgccggacagagtiggggagctcctagtttyLgyyyg 



6070 6080 6090 6100 6110 6120 

gaagctttgatatccatgccacgtccatccaccccaccccttttcgtcacgagcacaatg 



6130 6140 6150 6160 6170 

gtcttacattggatttttgfaRRaaaaraaaaatanatggagactttaactc 
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FIG. 9A 



Murine TRBX 



1 MTGYTMIJUIGGWpiKSGQTCMIAWSimiIU;.TWLSrrLFJ^ 



Murine SBBZ 
Bunn TRBZ 

MuriAe TRBX 
BV 



61 pKRIFGPH 
61 iSKRIFGPII 



3BLC£VKBVLDLCRIR£5VS£ELLQLEAKRQELHSm lOCLNLR^ACKKS 
a BLCEVKHVLDLCRIRESVSEELLQLEAKRQELMSm MCLKUhlBACKKS 



121 {I£MAXQD|LLQIJCNVppQTEHSYK^^ 

121 feliAKQDgJ.QIJCMVgSCyrEHSYKELMAQNQPl^ 
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Murine TRBZ 

Bl 
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241 LVGEHQBI 
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IL F SLFHIfRXDGHliBVZINI.SRKSDTQllIJ;.YKVSTGB!H^^^ 
:QL nSLPHWRTTCHHHVIINLSlUCSiyroBrXYKVSTC] 



Murinffi TRBZ 



300 SlSLKbvAbYBMSFDLWSPLVHAK^ 

301 jSTpkW pYRPPFPLVVSPLVHAMSEPWFMEIPPQVPVK^^ 



Murine TRBZ 
B1 



360 RSFEEEHEGDPPADYODXUIIATtJCAVQDSKLDQVLVBFTCiaiQPKPSIiP^^ 

361 RSFEEEMEGDPPADYDDRIIATIJCAVQPSKLDQVLVEFTCKHQPKPSLPTBWALCGERED 



Murine TRBZ 
Buaun TRBZ 

Murine TRBZ 



420 RT.mT«KT»STFALirrPGDPRIj[;.|ISSGCATRLFBi^^ DMLQWHE 

421 SRr,ETJ«Ta»STFAI^ITPGDPRIjyirSSGCATRLFEAIJBVGAVPVVIiGE^ 



460 yU^VVPKPRVTEVHFXJ:JlSLSDSDLUmRRQGRFI.HE' 
481 AALVVPKPRVTE VHFL IJtSI^DSDIJjmRRQGRFL W J^rX f T a ri 



ITYFjSpPADSIFHTVLAHIRTRIQI 
^ADSIFNTVLAMIRTRIQI 



Murine TRBZ 



Murine TRBZ 



540 IPAAPIj 

541 PAAPIRE] 



REEVUUkE 
RESyMiB; 



IPHRSGKAAGTDPNHADKGDLDLGPVETEPPYAS 
IPHRSGKAAGTDPNMADNGDLPLGPVETEPPYAS 



.TVTDC 
.TVTDF 



^miSBa»6RFHLFPHTPF0PVLPSEAKFLGSGTGFRPIGGGAG6SGXEFQAAI/36H\^ 
irals ifHckpG P rHLFPHTPFPPVLPSEAKFLGSGTGFRPIGGGAGGSGKEFQAAIX^GIiVg lRl 



660 £QFTVVHI.TmtEEVIJINSLERZJVGLPYLllXVVVVim 

661 EQFTVVMLTYBREEVIJtMSLERIJiGLPYLWKVVVVWHSPm 



3 



Murine TRBZ 
auBfl 



720 



£KNSUIMRFLPHMEIETEAII^IDDDAHIJUIDEIHFGF]H^^ 



TRBZ 721 EKHSLHNRFLPWWEIETEAILSIDDDAHLRHDEIMFGFWVWREARPRIVGFPGRYHAWDIi 



iRXVGFPGRYHAHDIj 



Murine TRBZ 
Bl 



780 iPHQSIiLYllSMYSCELSHVLaXSAAFFHRYYAYLYSYVMPQAIRDHVDEYINCEDZAMNFLVi 

781 PHQSWLYMSNYSCELSHVLTOAAFFHKYYAYLYSYVMPQAIRDMVDBYIHCEDIAM 



Murine TRBZ 

BUBUn 



840 SHITRKPPIKVTSRWTFRCPGCPQALSHDDSHFHERHKCIHFFVKVYGYHPIJ.riX)FRVtt 

841 shitrkppikvtsrwtfrcpgcpqalshddshfherhkcinffvkvygympllytqfrvd: 



Murine TRBZ 
Bl 



900 SVLFKTRItPHDKTXCFKFI 

901 SVLFKTRLPHDKTKCFKFI 



suBsrmrre sheet (rule 26) 
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SEQUENCE LISTING 

5 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Sato, Takaaki 

10 (ii) TITLE OF INVENTION: TREX, A NOVEL GENE OF TRAF- INTERACTING 

EXT GENE FAMILY AND DIAGNOSTIC AND THERAPEUTIC USES 
THEREOF 

(ill) NUMBER OF SEQUENCES: 37 ' 

15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooper & Dunham LLP 

(B) STREET: 1185 Avenue of the Americas 

(C) CITY: New York 
20 (D) STATE: New York 

(E) COUNTRY: U.S. A 

(F) ZIP: 10036 

(V) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC con^atible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: PatentIn Release #1-0, Version #1.30 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

35 (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: White, John P. 

(B) REGISTRATION NUMBER: 28,678 

(C) REFERENCE/DOCKET NUMBER: 0575/51902-A-PCT 

40 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 278-0400 

(B) TELEFAX: (212) 391-0525 



45(2) INFORMATION FOR SEQ ID NO;l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTIt: 3479 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



55 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 458.. 3211 



60 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
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CCTGATCGTT GGTAGTGGCA TGGA.GGACGG G6CTGGCA.TT TCAGACTGCC AGCTGTTTTT 60 

ACCAGCCGCT GCATCACTTG AATAGAAlGCT ATGCATATTG GCTGGCCGAC AAAGCCAAGG 120 

5GACAAAAGCT MGGCCGTTA AAATGGTCCC TCTGAGTCCA GGGCTCTTTC CCTGGCTTTT 180 

AGCACCATGG ATCTCTTCCT TTTCATCCCA TCAGCAATGT GGTACCTTCT TCTACTTGAT 240 

GATGACAGCT GATACTTCAG ATTTGCCTGA CTAAGGTTAG T^CCTGAAT CGCTGTGAGG 300 

10 

AAGATGAAAT TTCCATTTTA CTTGGTGCCT TGTGCAGGGA GCACACTGAT CCTTCCAGAA 360 

ACTTGTGTGT GA7\AAGAGGT TGCGTTTTGT CAGACAGACT CATGGTTATG GCGAGCGATC 420 

15CGACGTG3^TC AGAGTGGGCA AGAGGCACAG. CGAACTC ATG ACA GGC TAT ACC ATG 475 

Met Thr Gly Tyr Thr Met 
1 5 

TTG CGG AAT GGG GGA GTG GGG AAC GGT GGT CAG ACC TGT ATG CTG CGC 523 
20ljeu Arg Asn Gly Gly Val Gly Asn Gly Gly Gin Thr Cys Met Leu Arg 
10 15 20 

TGG TCC AAT CGC ATC CGG CTG ACA TGG CTG AGT TTC ACQ CTG TTC ATC 571 
Trp Ser Asn Arg He Arg Leu Thr Trp Leu Ser Phe Thr Leu Phe He 
25 25 30 35 

ATC CTC GTC TTC TTC CCC CTC ATT GCT CAC TAT TAC CTC ACC ACT CTG 619 
He Leu Val Phe Phe Pro Leu He Ala His Tyr Tyr Leu Thr Thr Leu 
40 45 50 

30 

GAC GAG GCA GAC GAG GCT GGC AAG CGC ATC TTC GGC CCT CGG GCT GGC 667 
Asp Glu Ala Asp Glu Ala Gly Lys Arg lie Phe Gly Pro Arg Ala Gly 
55 60 65 70 

35AGT GAG CTC TGT GAG GTA AAG CAT GTC CTT GAT CTC TGT CGG ATT CGT 715 
Ser Glu Leu Cys Glu Val Lys His Val Leu Asp Leu Cys Arg He Arg 
75 80 85 

GAG TCT GTG AGC GAA GAG CTT CTA CAG CTC GAA GCC AAG CGG CAG GAG 763 
4061u Ser Val Ser Glu Glu Leu Leu Gin Leu Glu Ala Lys Arg Gin Glu 
90 95 100 

CTG AAC AGC GAG ATT GCC AAG CTG AAC CTC AAG ATT GAA GCC TGT AAG 811 
Leu Asn Ser Glu He Ala Lys Leu Asn Leu Lys He Glu Ala Cys Lys 
45 105 110 115 

AAG AGC ATA GAG AAT GCC AAG CAG GAC CTG CTG CAG CTC AAG AAT GTC 859 

Lys Ser He Glu Asn Ala Lys Gin Asp Leu Leu Gin Leu Lys Asn Val 
120 125 130 

50 

ATT AGC CAG ACA GAG CAC TCC TAC AAG GAG CTG ATG GCC CAG AAC CAG 907 

He Ser Gin Thr Glu His Ser Tyr Lys Glu Leu Met Ala Gin Asn Gin 
135 140 145 150 

SSCCC TUtfV CTG TCC CTG CCC ATC CGA CTG CTC CCT GAG AAG GAC GAT GCC 955 
Pro Lys Leu Ser Leu Pro He Arg Leu Leu Pro Glu Lys Asp Asp Ala 
155 160 165 . 

GGC CTT CCA CCC CCC AAG GTC ACT CGG GGT TGC CGC CTT CAC AAC TGC 1003 
60Gly Leu Pro Pro Pro Lys Val Thr Arg Gly Cys Arg Leu His Asn Cys 
170 175 180 



TTT GAT TAC TCT CGT TGT CCT CTG ACG TCT GGC TTT CCC GTC TAC GTC 



1051 
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Phe Asp Tyr Ser Arg Cys Pro Leu Thr Ser Gly Phe Pro Val Tyr Val 
185 190 195 

TAT GAC AGT GAC CAG TTT GCC TTT GGG AGC TAC CTG GAC CCT TTG GTC 1099 
5Tyr Asp Ser Asp Gin Phe Ala Phe Gly Ser Tyr Leu Asp Pro Leu Val 
200 205 210 

AAG CAG OCT TTT CAG GCT ACA GTG AGA GCC AAC GTT TAT GTT ACA GAA 1147 
Lys Gin Ala Phe Gin Ala Thr Val Arg Ala Asn Val Tyr Val Thr Glu 
10215 220 225 230 

AAT GCG GCC ATC GCC TGC CTG TAT GTG GTG TTA GTG GGA GAA ATG CAA 1195 
Asn Ala. Ala lie Ala Cys Leu Tyr Val Val Leu Val Gly Glu Met Gin 
235 240 245 

15 

GAG CCC ACT GTG CTG CGG CCT GCC GAC CTT GAA AAG CAG CTG TTT TCT 1243 
Glu Pro Thr Val Leu Arg Pro Ala Asp Leu Glu Lys Gin Leu Phe Ser 
250 255 260 

20CTG CCA CAC TGG AGG ACA GAT GGG CAC AAC CAC GTC ATT ATC AAC CTG 1291 
Leu Pro His Trp Arg Thr Asp Gly His Asn His Val lie lie Asn Leu 
265 270 275 

TCC CGG AAG TCA GAC ACA CAG AAT CTA CTG TAC AAC GTC AGT ACA GGC 1339 
25ser Arg Lys Ser Asp Thr Gin Asn Leu Leu Tyr Asn Val Ser Thr Gly 
280 285 290 

CGC CAT GTG GCC CAG TCC ACC CTC TAT GCT GCC CAG TAC AGA GCT GGC 1387 
Arg His Val Ala Gin Ser Thr Leu Tyr Ala Ala Gin Tyr Arg Ala Gly 
30295 300 305 310 

TTT GAC CTG GTC GTG TCA CCC CTT GTC CAT GCT ATG TCT GAA CCC AAC 1435 

Phe Asp Leu Val Val Ser Pro Leu Val His Ala Met ser Glu Pro Asn 

315 320 325 

35 

TTC MG GAA ATC CCA CCG CAG GTG CCA GTT AAG CGG AAA TAT CTC TTC 1483 

Phe Met Glu lie Pro Pro Gin Val Pro Val Lys Arg Lys Tyr Leu Phe 
330 335 340 

40ACT TTC CAG GGC GAG AAG ATC GAG TCT CTG AGA TCT AGC CTT CAG GAG 1531 
Thr Phe Gin Gly Glu Lys lie Glu Ser Leu Arg Ser Ser Leu Gin Glu 
345 350 355 

GCC CGT TCC TTC GAG GAA GAG ATG GAG GGC GAC CCT CCG GCC GAC TAT 1579 
45Ala Arg Ser Phe Glu Glu Glu Met Glu Gly Asp Pro Pro Ala Asp Tyr 
360 365 370 

GAC GAT CGC ATC ATT GCC ACC CTA AAG GCT GTA CAG GAC AGC 7KAG CTG 1627 
Asp TVsp Arg lie lie Ala Thr Leu Lys Ala Val Gin Asp Ser Lys Leu 
50375 380 385 390 

GAT CAG GTG CTG GTA GAA TTC ACT TGC AAA AAC CAG CCG AAG CCT AGC 1675 
Asp Gin Val Leu Val Glu Phe Thr Cys Lys Asn Gin Pro Lys Pro Ser 
395 400 405 

55 

CTG CCG ACT GAG TGG GCA CTG TGT GGG GAG CGG GAA GAC CGC CTG GAG 1723 
Leu Pro Thr Glu Trp Ala Leu Cys Gly Glu Arg Glu Asp Arg Leu Glu 
410 415 420 

60TTA CTG RAG CTC TCC ACC TTC GCC CTC ATC ATC ACT CCC GGG GAC CCG 1771 
Leu Leu Lys Leu Ser Thr Phe Ala Leu He He Thr Pro Gly Asp Pro 
425 430 435 
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CGC CTG CTC ATT TCA TCT GGG TGT GCC ACG CGG CTC TTC GAG GCC CTG 1819 
Arg Leu Leu lie Ser Ser Gly Cys Ala Thr Arg Leu Phe Glu Ala Leu 
440 445 450 

5GA6 GTG GGG GCC GTG CCG GTG GT6 CTC GGG GAG CAG GTG GAG CTC CCG 1867 
Glu Val Gly Ala Val Pro Val Val Leu Gly Glu Gin Val Gin Leu Pro 
455 460 465 470 

TAC CAC GAC ATG CTG CAG TGG AAC GAG GCC GCC CTG GTG GTG CCC AAG 1915 
lOTyr His Asp Met Leu Gin Trp Asn Glu Ala Ala Leu Val Val Pro Lys 
475 480 485 

CCT CGC GTC ACA GAG GTC CAC TTC CTG TTA CGA AGT CTT TCA GAC AGT 1963 
Pro Arg Val Thr Glu Val His Phe Leu Leu Arg Ser Leu Ser Asp Ser 
15 490 495 500 

GAT CTG TTG GCC ATG AGG CGG CAA GGC CGC TTT CTC TGG GAG ACC TAC 2011 

Asp Leu Leu Ala Met Arg Arg Gin Gly Arg Phe Leu Trp Glu Thr Tyr 
505 510 515 

20 

TTC TCC ACC GCA GAC AGT ATT TTT AAT ACC GTG CTG GCC ATG ATT AGG 2059 

Phe Ser Thr Ala Asp Ser He Phe Asn Thr Val Leu Ala Met He Arg 

520 525 530 

25ACT CGA ATT CAG ATC CCA GCT GCT CCC ATC CGG GAA GAG GTA GCG GCT 2107 
Thr Arg He Gin He Pro Ala Ala Pro He Arg Glu Glu Val Ala Ala 
535 540 545 550 

GAG ATC CCC CAT CGT TCA GGC 7\AA GCA GCT GGA ACT GAC CCC AAC ATG 2155 
30g1u He Pro His Arg Ser Gly Lys Ala Ala Gly Thr Asp Pro Asn Met 
555 560 565 

GCT GAC AAT GGG GAC CTG GAC CTG GGG CCG GTA GAG ACA GAA CCA CCC 2203 
Ala Asp Asn Gly Asp Leu Asp Leu Gly Pro Val Glu Thr Glu Pro Pro 
35 570 575 580 

TAT GCC TCA CCT AAA TAC CTC CGC AAT TTC ACT CTG ACT GTC ACA GAC 2251 
Tyr Ala Ser Pro Lys Tyr Leu Arg Asn Phe Thr Leu Thr Val Thr Asp 
585 590 595 

40 

TGT TAC CGT GGC TGG AAC TCT GCC CCG GGA CGG TTC CAT CTT TTT CCC 2299 
Cys Tyr Arg Gly Trp Asn Ser Ala Pro Gly Arg Phe His Leu Phe Pro 
600 605 610 

45CAC ACA CCC TTT GAT CCT GTG TTG CCC TCT GAG GCC AAA TTC TTG GGC 2347 
His Thr Pro Phe Asp Pro Val Leu Pro Ser Glu Ala Lys Phe Leu Gly 
615 620 625 630 

TCA GGG ACT GGA TTT CGG CCG ATC GGT GGC GGG GCT GGG GGC TCT GGC 2395 
50Ser Gly Thr Gly Phe Arg Pro He Gly Gly Gly Ala Gly Gly Ser Gly 
635 640 645 

AAG GAG TTC CAG GCA GCG CTC GGA GGC AAT GTC CAG CGG GAG CAG TTC 2443 
Lys Glu Phe Gin Ala Ala Leu Gly Gly Asn Val Gin Arg Glu Gin Phe 
55 650 655 660 

ACA GTT GTG ATG CTG ACC TAC GAG CGG GAG GAA GTG CTC ATG AAC TCC 2491 
Thr Val Val Met Leu Thr Tyr Glu Arg Glu Glu Val Leu Met Asn Ser 
665 670 675 

60 

CTG GAG AGA CTC AAC GGC CTC CCC TAC CTG AAC AAG GTA GTG GTG GTG 2539 
Leu Glu Arg Leu Asn Gly Leu Pro Tyr Leu Asn Lys Val Val Val Val 
680 685 690 
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TGG AAC TCT CCC AAG CTG CCC TCG GAG GAG CTT TTG TGG CCA GAC ATT 2587 
Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp Leu Leu Trp Pro Asp lie 
695 700 705 710 

5GGT GTC CCC ATC ATG GTC GTC CGT ACT GAG AAG AAC AGT TTG AAC AAT 2635 
Gly Val Pro ile Met Val Val Arg Thr Glu Lys Asn Ser Leu Asn Asn 
715 720 725 

CGG TTC TTG CCC TGG AAT GAG ATT GAG ACA GAG GCC ATA CTG TCC ATC 2683 
10 Arg Phe Leu Pro Trp Asn Glu Ile Glu Thr Glu Ala Ile Leu Ser Ile 
730 735 740 

GAC GAT GAT GCT CAC CTC CGC CAT GAT GAA ATC ATG TTT GGG TTT TGG 2731 
Asp Asp Asp Ala His Leu Arg His Asp Glu lie Met Phe Gly Phe Trp 
15 745 750 . 755 

GTG TGG AGA GAA GCA CGT GAT CGC ATT GTG GGT TTC CCT GGC CGG TAC 2779 
Val Trp Arg Glu Ala Arg Asp Arg lie Val Gly Phe Pro Gly TVrg Tyr 
760 765 770 

20 

CAT GCG TGG GAC ATC CCG CAC CAG TCC TGG CTC TAC AAT TCC AAC TAC 2827 
His Ala Trp Asp lie Pro His Gin Ser Trp Leu Tyr Asn Ser Asn Tyr 
775 780 785 790 

25TCC TGT GAG CTG TCC ATG GTG CTG ACG GGC GCT GCC TTC TTT CAC AAG 2875 
Ser Cys Glu Leu Ser Met Val Leu Thr Gly Ala Ala Phe Phe His Lys 
795 800 805 

TAT TAT GCC TAC CTG TAT TCT TAT GTG ATG CCC CAG GCC ATC CGG GAC 2923 
30Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met Pro Gin Ala lie Arg Asp 
810 815 820 

ATG GTG GAC GAG TAC ATC AAC TGT GAG GAT ATC GCC ATG AAC TTC CTT 2971 
Met Val Asp Glu Tyr lie Asn Cys Glu Asp lie Ala Met TVsn Phe Leu 
35 825 830 835 

GTC TCC CAC ATC ACA CGG AAA CCC CCC ATC AAG GTG ACA TCA AGG TGG 3019 

Val Ser His lie Thr Arg Lys Pro Pro lie Lys Val Thr Ser Arg Trp 

840 845 850 

40 

ACT TTT C6A TGC CCA GGG TGC CCT CAG GCC CTG TCC CAT GAT GAC TCT 3067 

Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala Leu Ser His Asp Asp Ser 

855 860 865 870 

45CAT TTT CAC GAG CGG CAC AAG TGT ATC AAC TTT TTT GTC AAG GTG TAC 3115 
His Phe His Glu Arg His Lys Cys lie Asn Phe Phe Val Lys Val Tyr 
875 880 885 

GGC TAT ATG CCT CTC TTG TAC ACA CAG TTC AGG GTG GAC TCC GTG CTC 3163 
SOGly Tyr Met Pro Leu Leu Tyr Thr Gin Phe Arg Val Asp Ser Val Leu 
890 895 900 

TTC AAG ACC CGC CTG CCC CAT GAC AAG ACC AAG TGC TTC AAG TTC ATC 3211 
Phe Lys Thr Arg Leu Pro His Asp Lys Thr Lys Cys Phe Lys Phe lie 
55 905 910 915 

TAGGGCCTTG CAGTTCTGAG G9VGACAATGA GCAGAGCGAG GGGGAGTCAC CCTCAAGGTT 3271 

CCCAAGGTGT CGAAGGTCCT TGGGGACATC TGTCGGGCAG GGCCAAGACC CTTTGCTGGG 3331 

60 

AGAGGCAGCA GGAAGAGTGG ATUVGGGATAG CTGTCTTTCA TTTTGAAGTC AGCCACACTG 3391 



GGCCTGGGAT CCTGGTCAGA GACTCAGGNC GTCTGCACAG 



GGCACTGACT GATAGCGAAC 3451 
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ACTGAGGACT GTTCATAAGC CCAGGACA 3479 
(2) INFORMATION FOR SEQ ID NO: 2: 

5 

(i) SEQUENCE CHARACTERISTICS: i 

(A) LENGTH: 918 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: protein 

(xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

15Met Thr Gly Tyr Thr Met Leu Arg Asn Gly Gly Val Gly Asn Gly Gly 
15 10 15 

Gin Thr Cys Met Leu Arg Trp Ser Asn Arg lie Arg Leu Thr Trp Leu 
20 25 30 

20 

Ser Phe Thr Leu Phe lie lie Leu Val Phe Phe Pro Leu He Ala His 
35 40 45 

Tyr Tyr Leu Thr Thr Leu Asp Glu 7U.a Asp Glu Ala Gly Lys Arg He 
25 . 50 . 55 60 

Phe Gly Pro Arg Ala Gly Ser Glu Leu Cys Glu Val Lys His Val leu 
65 70 75 80 

30Asp Leu Cys Arg He Arg Glu Ser Val Ser Glu Glu Leu Leu Gin Leu 
85 90 95 

Glu Ala Lys Arg Gin Glu Leu Asn Ser Glu He Ala Lys Leu Asn Leu 
100 105 110 

35 

Lys He Glu Ala Cys Lys Lys Ser He Glu Asn Ala Lys Gin Asp Leu 
115 120 125 

Leu Gin Leu Lys Asn Val He Ser Gin Thr Glu His Ser Tyr Lys Glu 
40 130 135 140 

Leu Met 7U.a Gin Asn Gin Pro Lys Leu Ser Leu Pro He Arg Leu Leu 
145 150 155 160 

4 5 Pro Glu Lys Asp Asp Ala Gly Leu Pro Pro Pro Lys Val Thr Arg Gly 
165 170 175 

Cys Arg Leu His Asn Cys Phe Asp Tyr Ser Arg Cys Pro Leu Thr Ser 
180 185 190 

50 

Gly Phe Pro Val Tyr Val Tyr Asp Ser Asp Gin Phe Ala Phe Gly Ser 
195 200 205 

Tyr Leu Asp Pro Leu Val Lys Gin Ala Phe Gin Ala Thr Val 7U:g Ala 
55 210 215 220 

Asn Val Tyr Val Thr Glu Asn Ala Ala He Ala Cys Leu Tyr Val Val 
225 230 235 240 

60 Leu Val Gly Glu Met Gin Glu Pro Thr Val Leu Arg Pro Ala Asp Leu 
245 250 255 

Glu Lys Gin Leu Phe Ser Leu Pro His Trp Arg Thr Asp Gly His Asn 



I 
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260 265 270 

His Val He He Asn Leu Ser Arg Lys Ser Asp Thr Gin Asn Leu Leu 
275 280 285 

5 

Tyr Asn Val Ser Thr Gly Arg His Val Ala Gin Ser Thr Leu Tyr Ala 
290 295 300 

Ala Gin Tyr Arg Ala Gly Phe Asp Leu Val Val Ser Pro Leu Val His 
10305 310 315 320 

Ala Met Ser Glu Pro Asn Phe Met Glu He Pro Pro Gin Val Pro Val 
325 330 335 

15Lys Arg Lys Tyr Leu Phe Thr Phe Gin Gly Glu Lys He Glu Ser Leu 
340 345 350 

Arg Ser Ser Leu Gin Glu Ala Arg Ser Phe Glu Glu Glu Met Glu Gly 
355 360 365 

20 

Asp Pro Pro Ala Asp Tyr Asp Asp Arg He He Ala Thr Leu Lys Ala 
370 375 380 

Val Gin Asp Ser Lys Leu Asp Gin Val Leu Val Glu Phe Thr Cys Lys 
25385 390 395 400 

Asn Gin Pro Lys Pro Ser Leu Pro Thr Glu Trp Ala Leu Cys Gly Glu 
405 410 415 

BOArg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr Phe Ala Leu He 
420 425 430 

He Thr Pro Gly i\sp Pro Arg Leu Leu He Ser Ser Gly Cys Ala Thr 
435 440 445 

35 

Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro Val Val Leu Gly 
450 455 460 

Glu Gin Val Gin Leu Pro Tyr His Asp Met Leu Gin Trp Asn Glu Ala 
40465 470 475 480 

Ala Leu Val Val Pro Lys Pro Arg Val Thr Glu Val His Phe Leu Leu 
485 490 495 

45Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg Arg Gin Gly Arg 
500 505 510 

Phe Leu Trp Glu Thr Tyr Phe Ser Thr Ala Asp Ser He Phe Asn Thr 
515 520 525 

50 

Val Leu Ala Met He Arg Thr Arg He Gin He Pro Ala Ala Pro He 
530 535 540 

Arg Glu Glu Val Ala Ala Glu He Pro His Arg Ser Gly Lys Ala Ala 
55545 550 555 560 

Gly Thr Asp Pro Asn Met Ala Asp Asn Gly Asp Leu Asp Leu Gly Pro 
565 570 575 

6 Oval Glu Thr Glu Pro Pro Tyr Ala Ser Pro Lys Tyr Leu Arg Asn Phe 
580 585 590 

Thr Leu Thr Val Thr Asp Cys Tyr Arg Gly Trp Asn Ser Ala Pro Gly 
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595 600 605 

Arg Phe His Leu Phe Pro His Thr Pro Phe Asp Pro Val Leu Pro Ser 
610 615 620 

5 

Glu Ala Lys Phe Leu Gly Ser Gly Thr Gly Phe Arg Pro lie Gly Gly 
625 630 635 640 

Gly Ala Gly Gly Ser Gly Lys Glu Phe Gin Ala Ala Leu Gly Gly Asn 
10 645 650 655 

Val Gin Arg Glu Gin Phe Thr Val Val Met Leu Thr Tyr Glu Arg Glu 
660 665 670 

ISgIu Val Leu Met Asn Ser Leu Glu Arg Leu Asn Gly Leu Pro Tyr Leu 
675 680 685 

Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp 
690 695 700 

20 

Leu Leu Trp Pro Asp He Gly Val Pro He Met Val Val Arg Thr Glu 
705 710 715 720 

Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu He Glu Thr 
25 725 730 735 

Glu Ala He Leu Ser He Asp Asp Asp Ala His Leu Arg His Asp Glu 
740 745 750 

BOile Met Phe Gly Phe Trp Val Trp Arg Glu Ala Arg Asp Arg He Val 

755 760 765 

Gly Phe Pro Gly TVrg Tyr His Ala Trp Asp He Pro His Gin Ser Trp 
770 775 780 

35 

Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr Gly 
785 790 795 800 

Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met 
40 805 810 815 

Pro Gin Ala He Arg Asp Met Val Asp Glu Tyr He Asn Cys Glu Asp 
820 825 830 

45lle Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro He 
835 840 845 

Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala 
850 855 860 

50 

Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys He Asn 
865 870 875 880 

Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe 
55 885 890 895 

Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys Thr 
900 905 910 

60Lys Cys Phe Lys Phe He 
915 

(2) INFOKMATION FOR SEQ ID NO: 3: 



wo 00^8959 



FCr/US99ni6S4 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6172 base pairs 

(B) TYPE: nucleic acid 

(C) STR7WDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

10 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 594.. 3350 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGCGGGTCCC TGAGCTGGAA GCC6GAGAGC AA6CCCTGGA GGTTCACTCT TTCAAGAACT 60 

CGT6TGCTGA GGTGTAATGC TACACAAGTC AGAGGAAGGA A6G6TCCTGA AACACATGGC 120 

20 

CTGATTGTTG GCAAAGGCAT CATAAGAAGC TGGCATTTAT TTCTGTTCTA ACCTATTACT 180 

GTATAACTGT (3J\TAGACAC TATGCATATT TGTTGGTCAG CAAAACCAAG AAACAAGAGC 240 

25TATGGCATTT GAAAAAGTCT GTCTGATTCC AGGGTGTTTT TCCTGGGTTT CATCATCAGG 300 

TACCTCCTCC CTTTCATCTC AGCAAGAATG TGGCACCTTT TATCGTTTGA TAAAGATTAA 360 

GGACATGTTC TTTGGTCAAC AGCCAGAACT TAAAATCTGC TGGAATAGGG TCAGAGACCA 420 

30 

TTTCAGCTGC AGCTGAGGAA AATGAAATGT TCATTTTATT TGGTGCCTTG TCTGG<?GAGC 480 

ACACTAACTC TTCTGGAAAC GTGTCAGTGA 7\ACAGAGATC GTTTTGTGGA ATAGCAACCC 540 

35ATGGTTATGG CGAGTGACCC GACGTGATCT GGGGGGCAGG CTGCAGAGGA CTC ATG 596 

Met 

ACA GGC TAT ACC ATG CTG CGG AAT 66G GGC GCG GGG AAC GGA GGT CAG 644 
40Thr Gly Tyr Thr Met Leu Arg Asn Gly Gly Ala Gly Asn Gly Gly Gin 
920 925 930 935 

ACC TGC ATG CTG CGC TGG TCC AAC CGC ATC CGC CTC ACG TGG CTC AGC 692 
Thr Cys Met Leu Arg Trp Ser Asn Arg He Arg Leu Thr Trp Leu Ser 
45 940 945 950 

TTC ACG CTC TTT GTC ATC CTG GTC TTC TTC CCG CTC ATC GCC CAC TAT 740 

Phe Thr Leu Phe Val He Leu Val Phe Phe Pro Leu He Ala His Tyr 
955 960 965 

50 

TAC CTC ACC ACT CTG GAT GAG GCT G3VT GAG GCA GGC AAG CGG ATT TTT 788 

Tyr Leu Thr Thr Leu Asp Glu Ala Asp Glu Ala Gly Lys Arg He Phe 
970 975 980 

55GGT CCC CGG GTG GGG AAC GAG CTG TGC GAG GTG AAG CAC GTG CTG GAT 836 
Gly Pro Arg Val Gly Asn Glu Leu Cys Glu Val Lys His Val Leu Asp 
985 990 995 

CTG TGC CGC ATC CGG GAG TCG GTG AGT GAA GAG CTC CTG CAG CTG GAG 884 
60Leu Cys Arg He Arg Glu Ser Val Ser Glu Glu Leu Leu Gin Leu Glu 
1000 1005 1010 1015 

GCC AAG CGC CAA GAG CTG AAC AGC GAG ATC GCC AAG CTG AAT CTG AAG 932 
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Ala Lys Arg Gin Glu Leu Asn Ser Glu lie Ala Lys Ijeu Asn Leu Lys 
1020 1025 1030 

ATC GAA GCC TGT. AAG AAG AGC ATT GAG AAC GCC AAG CAG GAG CTG CTC 980 
5lle Glu Ala Cys Lys Lys Ser lie Glu Asn Ala Lys Gin Asp Leu Leu 
1035 1040 1045 

CAG CTC AAG AAT GTC ATC AGC CAG ACC GAG CAT TCC TAC AAG GAG CTC 1028 
Gin Leu Lys Asn Val lie Ser Gin Thr Glu His Ser Tyr Lys Glu Leu 
10 1050 1055 1060 

ATG GCC CAG AAC CAG CCC AAG CTG TCC CTG CCC ATC CGA CTG CTC CCA 1076 

Met Ala Gin Asn Gin Pro Lys Leu Ser Leu Pro lie Arg Leu Leu Pro 
1065 1070 1075 

15 

GAG AAG GAC GAT GCC GGC CTC CCT CCC CCG AAG GCC ACT CGG GGC TGC 1124 

Glu Lys Asp Asp Ala Gly Leu Pro Pro Pro Lys Ala Thr Arg Giy Cys 
1080 1085 1090 1095 

20CGG CTA CAC AAC TGC TTT GAT TAT TCT CGT TGC CCT CTC ACC TCT GGC 1172 
Arg Leu His Asn Cys Phe Asp Tyr Ser Arg Cys Pro Leu Thr Ser Gly 
1100 1105 1110 

TTC CCG GTC TAC GTC TAT GAC AGT GAC CAG TTT GTC TTT GGC AGC TAC 1220 
25Phe Pro Val Tyr Val Tyr Asp Ser Asp Gin Phe Val Phe Gly Ser Tyr 
.1115 1120 1125 

CTG GAT CCC TTG GTC AAG CAG GCT TTT CAG GCG ACA GCA CGA GCT AAC 1268 
Leu Asp Pro Leu Val Lys Gin Ala Phe Gin Ala Thr Ala Arg Ala Asn 
30 1130 1135 1140 

GTT TAT GTT ACA GAA AAT GCA GAC ATC GCC TGC CTT TAC GTG ATA CTA 1316 
Val Tyr Val Thr Glu Asn Ala Asp lie Ala Cys Leu Tyr Val He Leu 
1145 1150 1155 

35 

GTG GGA GAG ATG CAG GAG CCC GTG GTG CTG CGG CCT GCT GAG CTG GAG 1364 
Val Gly Glu Met Gin Glu Pro Val Val Leu Arg Pro Ala Glu Leu Glu 
1160 1165 1170 1175 

40AAG CAG TTG TAT TCC CTG CCA CAC TGG CGG ACG GAT GGA CAC AAC CAT 1412 
Lys Gin Leu Tyr Ser Leu Pro His Trp Arg Thr Asp Gly His Asn His 
1180 1185 1190 

GTC ATC ATC AAT CTG TCA CGT AAG TCA GAT ACA CAG AAC CTT CTC TAT 1460 
45val He lie Asn Leu Ser Arg Lys Ser Asp Thr Gin Asn Leu Leu Tyr 
1195 1200 1205 

7UVC GTC AGT ACT GGC CGT GCC ATG GTG GCC CAG TCC ACC TTC TAC ACT 1508 
Asn Val Ser Thr Gly Arg Ala Met Val Ala Gin Ser Thr Phe Tyr Thr 
50 1210 1215 1220 

GTC CAG TAC AGA CCT GGC TTT GAC TTG GTC GTA TCA CCG CTG GTC CAT 1556 
Val Gin Tyr Arg Pro Gly Phe Asp Leu Val Val Ser Pro Leu Val His 
1225 1230 1235 

55 

GCC ATG TCT GAG CCC AAC TTC ATG GAA ATC CCA CCA CAG GTG CCG GTG 1604 
Ala Met Ser Glu Pro Asn Phe Met Glu He Pro Pro Gin Val Pro Val 
1240 1245 1250 1255 

60AAG CGG AAA TAT CTC TTC ACC TTC CAG GGC GAG AAG ATT GAG TCT CTG 1652 
Lys Arg Lys Tyr Leu Phe Thr phe Gin Gly Glu Lys He Glu Ser Leu 
1260 1265 1270 
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AGG 
Arg 


TCT 
Ser 


AGC 
Ser 


CTT CAG 
Leu Gin 
1275 


GAG GCC 
Glu Ala 


CGC TCC TTC 
Arg. Ser Phe 
1280 


GAA 
Glu 


GAG 
Glu 


GAA ATG GAG 
Glu Met Glu 
1285 


GGC 
Gly 


1700 


5gac 
Asp 


OCT 
Pro 


CCC GCC 
Pro Ala 
1290 


GAC TAC GAT 
Asp Tyr Asp 


GAC CGG 
Asp Arg 
1295 


ATC 
He 


ATT GCC 
He Ala 


ACC CTG 
Thr Leu 
1300 


AAG GCG 
Lys Ala 


1748 


GTG 
lOval 


CAG GAC 
Gin Asp 
1305 


AGC 
Ser 


AAG 
Lys 


CTG GAT CAG 
Leu Asp Gin 
1310 


GTC 
Val 


CTG 
Leu 


GTG 
val 


GAA TTC 
Glu Phe 
1315 


ACC 
Thr 


TGC AAA 
Cys Lys 


1796 


AAC CAG 
Asn Gin 
151320 


CCC 
Pro 


AAA 
Lys 


CCC 
Pro 


AGC CTG 
Ser Leu 
1325 


CCG 
Pro 


ACT 
Thr 


GAG 
Glu 


TGG GCA 
Trp Ala 
1330 


CTG 
Leu 


TGT 
Cys 


GGA GAG 
Gly Glu 
1335 


1844 


CGG 
Arg 

20 


GAG 
Glu 


GAC 
Asp 


CGC 
Arg 


TTG GAA 
Leu Glu 
1340 


TTG 
Leu 


CTG 
Leu 


AAG 
Lys 


CTC TCC 
Leu Ser 
1345 


ACC 
Thr 


TTC 
Phe 


GCC 
Ala 


CTC ATC 
Leu He 
1350 


1892 


ATT 
He 


ACC 
Thr 


CCC 

Pro 


GGG GAC 
Gly Asp 
1355 


CCT 

Pro 


CGC 
Arg 


TTG 
Leu 


GTT ATT 
Val He 
1360 


TCC 
Ser 


TCT 
Ser 


GGG 
Gly 


TGT GCA 
Cys Ala 
1365 


ACA 
Thr 


1940 


25CGG 
Arg 


CTC 
Leu 


TTC GAA 
Phe Glu 
1370 


GCC 
Ala 


CTG 
Leu 


GAA 
Glu 


GTC GGT 
Val Gly 
1375 


GCC 
Ala 


GTC 
Val 


CCG 
Pro 


GTG GTG 
Val Val 
1380 


CTG 
Leu 


GGG 
Gly 


1988 


GAG 
30G1U 


CAG GTC 
Gin Val 
1385 


CAG 
Gin 


CTT 
Leu 


CCC 

Pro 


TAC CAG 
Tyr Gin 
1390 


GAC 
J\sp 


ATG 
Met 


CTG 
Leu 


CAG TGG 
Gin Trp 
1395 


AAC 
Asn 


GAG 
Glu 


GCG 
Ala 


2036 


GCC CTG 
Ala Leu 
351400 


GTG 
Val 


GTG 
Val 


CCA 
Pro 


AAG CCT 

Lys Pro 
1405 


CGT 
Arg 


GTT 
Val 


ACC 
Thr 


GAG GTT 
Glu Val 
1410 


CAT 
His 


TTC 
Phe 


CTG 
Leu 


CTC 
Leu 
1415 


2084 


AGA 
Arg 

40 


AGC 
Ser 


CTC 
Leu 


TCC 
Ser 


GAT AGT 
Asp Ser 
1420 


GAC 
Asp 


CTC 
Leu 


CTG 
Leu 


GCT ATG 
Ala Met 
1425 


AGG 
Arg 


CGG 
Arg 


CAA 
Gin 


GGC CGC 
Gly Arg 
1430 


2132 


TTT 
Phe 


CTC 
Leu 


TGG 
Trp 


GAG ACT 
Glu Thr 
1435 


TAC 
Tyr 


TTC 
Phe 


TCC 
Ser 


ACT GCT 
Thr Ala 
1440 


GAC 
Asp 


AGT 
Ser 


ATT 
He 


TTT AAT 
Phe Asn 
1445 


ACC 
Thr 


2180 


45GTG 
Val 


CTG 
Leu 


GCT ATG 
Ala Met 
1450 


ATT 
He 


AGG 
Arg 


ACT 
Thr 


CGC ATC 
Arg He 
1455 


CAG 
Gin 


ATC 
He 


CCA 
Pro 


GCC GCT 
Ala Ala 
1460 


CCC 
Pro 


ATC 
He 


2228 


CGG 
SOArg 


GAA GAG 
Glu Glu 
1465 


GCG 
Ala 


GCA 
Ala 


GCT 
Ala 


GAG ATC 
Glu He 
1470 


CCC 
Pro 


CAC 
His 


CGT 
Arg 


TCA GGC 
Ser Gly 
1475 


AAG 
Lys 


GCG 
Ala 


GCT 
Ala 


2276 


GGA ACT GAC 
Gly Thr Asp 
551480 


CCC 
Pro 


AAC 

Asn 


ATG GCT GAC 
Met Ala Asp 
1485 


AAC 
Asn 


GGG 
Gly 


GAC CTG GAC 
Asp Leu Asp 
1490 


CTG 
Leu 


GGG 
Gly 


CCA 
Pro 
1495 


2324 


GTG 
Val 


GAG 
Glu 


ACG 
Thr 


GAG 
Glu 


CCG 
Pro 


CCC TAC GCC 
Pro Tyr Ala 


TCA 
Ser 


CCC 
Pro 


AGA TAC CTC 
Arg Tyr Leu 


CGC 
Arg 


AAT 
Asn 


TTC 
Phe 


2372 



1500 1505 1510 

60 



ACT CTG ACT GTC ACT GAC TTT TAC CGC AGC TGG AAC TGT GCT CCA GGG 2420 
Thr Leu Thr Val Thr Asp Phe Tyr Arg Ser Trp Asn Cys Ala Pro Gly 
1515 1520 1525 
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CCT 


TTC 


CAT 


CTT 


TTC 


CCC 


CAC 


ACT 


CCC 


TTT 


GAC 


CCT 


GTG 


TTG 


CCC 


TCA 


2468 


Pro 


Phe 


His 


Leu 


Phe 


Pro 


His 


Thr 


Pro 


Phe 


Asp 


Pro Val 


Leu 


Pro 


Ser 








1530 








1535 








1540 








5G7VG GCC AAA 


TTC 


TTG 


GGC 


TCA 


GGG 


ACT 


GGC 


TTT 


CGG 


CCT 


ATT 


GGT 


GGT 


2516 


Glu Ala 


Lys 


Phe 


Leu 


Gly Ser Gly 


Thr Gly 


Phe Arg 


Pro 


He Gly Gly 






1545 








1550 








1555 










GGA 


GCT 


GGG 


GGT 


TCT 


GGC 


AAG 


GAA 


TTT 


CAG 


GCA 


GCG 


CTT 


GGA 


GGC 


RAT 


2564 


lOGly Ala 


Gly 


Gly 


Ser 


Gly Lys 


Glu 


Phe 


Gin 


Ala 


Ala 


Leu 


Gly Gly Asn 




1560 








1565 








1570 








1575 




GTT 


CCC 


CGA 


GAG 


CAG 


TTC 


ACG 


GTG 


GTG ATG 


TTG 


ACT 


TAT 


GAG 


CGG 


GAG 


2612 


Val 


Pro 


Arg 


Glu 


Gin 


Phe 


Thr 


Val 


Val 


Met 


Leu 


Thr 


Tyr 


Glu Arg Glu 




15 






1580 








1585 








1590 




GAA 


GTG 


CTT 


ATG 


AAC 


TCT 


TTA 


GAG 


AGG 


CTG 


AAT 


GGC 


CTC 


CCT 


TAC 


CTG 


2660 


Glu 


Val 


Leu 


Met 


Asn 


Ser 


Leu 


Glu 


Arg 


Leu 


Asn 


Gly 


Leu 


Pro Tyr 


Leu 










1595 








1600 








1605 






20 


































AAC 


AAG 


GTC 


GTG 


GTG 


GTG 


TGG 


AAT 


TCT 


CCC 


AAG 


CTG 


CCA 


TCA 


GAG 


GAC 


2708 


Asn 


Lys 


Val 


Val 


Val 


Val 


Trp 


Asn 


Ser 


Pro 


Lys 


Leu 


Pro 


Ser 


Glu 


Asp 






1610 








1615 








1620 








25CTT 


CTG 


TGG 


CCT 


GAG 


ATT 


GGC 


GTT 


CCC 


ATC 


ATG 


GTG 


GTC 


CGT 


ACT 


GAG 


2756 


Leu 


Leu 


Trp 


Pro 


Asp 


He 


Gly 


Val 


Pro 


He 


Met 


Val 


Val 


Arg 


Thr 


Glu 






1625 








1630 








1635 










AAG 


AAC 


AGT 


TTG 


AAC 


AAC 


CGA 


TTC 


TTA 


CCC 


TGG 


AAT 


GAA 


ATT 


GAG 


ACA 


2804 


30Lys 


Asn 


ser 


Leu 


Asn 


Asn 


Arg 


Phe 


Leu 


Pro 


Trp 


Asn 


Glu 


He 


Glu 


Thr 




1640 








1645 








1650 








1655 




GAG 


GCC 


ATC 


CTG 


TCC 


ATT 


GAT 


GAC 


GAT 


GCT 


CAC 


CTC 


CGC 


CAT 


GAC 


GAA 


2852 


Glu 


Ala 


He 


Leu 


Ser 


He 


Asp 


Asp 


Asp 


Ala 


His 


Leu 


Arg 


His 


Asp 


Glu 




35 








1660 








1665 








1670 




ATC 


ATG 


TTT 


GGG 


TTC 


CGG 


GTG 


TGG 


AGA 


GAA 


GCT 


CGG 


GAC 


CGC 


ATC 


GTG 


2900 


He 


Met 


Phe 


Gly 


Phe 


Arg 


Val 


Trp 


TVrg 


Glu 


Ala 


Arg 


Asp 


Arg 


He 


Val 










1675 








1680 








1685 






40 


































GGC 


TTC 


CCT 


GGC 


CGT 


TAC 


CAC 


GCA 


TGG 


GAC 


ATC 


CCC 


CAT 


CAG 


TCC 


TGG 


2948 


Gly 


Phe 


Pro 


Gly 


Arg 


•Pyr 


His 


Ala 


Trp 


Asp 


He 


Pro 


His 


Gin 


Ser 


Trp 






1690 








1695 








1700 








45CTC 


TAC 


AAC 


TCC 


AAC 


TAC 


TCC 


TGT 


GAG 


CTG 


TCC 


ATG 


GTG 


CTG 


ACA 


GGT 


2996 


Leu 


Tyr 


Asn 


Ser 


Asn 


Tyr 


Ser 


Cys 


Glu 


Leu 


Ser 


Met 


Val 


Leu 


Thr 


Gly 






1705 








1710 








1715 










GCT 


GCC 


TTC 


TTT 


CAC 


AAG 


TAT 


TAT 


GCC 


TAC 


CTG 


TAT 


TCT 


TAT 


GTG 


ATG 


3044 


50Ala 


Ala 


Phe 


Phe 


His 


Lys 


Tyr 


Tyr 


Ala 


Tyr 


Leu 


Tyr 


Ser 


Tyr 


Val 


Met 




1720 








1725 








1730 








1735 




CCC 


CAG 


GCC 


ATC 


CGG 


GAC 


ATG 


GTG 


GAT 


GAA 


TAC 


ATC 


AAC 


TGT 


GAG 


GAC 




Pro 


Gin 


Ala 


He 


Arg 


Asp Met 


Val 


Asp 


Glu 


Tyr He 


Asn 


Cys 


Glu 


Asp 




55 








1740 








1745 








1750 




ATT 


GCC 


ATG 


AAC 


TTC 


CTT 


GTC 


TCC 


CAC 


ATC 


ACT 


CGG 


AAG 


CCC 


CCC 


ATC 


3140 


He Ala 


Met 


Asn 


Phe 


Leu 


Val 


Ser 


His 


He Thr Arg 


Lys 


Pro 


Pro 


He 










1755 








1760 








1765 






60 


































AAG 


GTG 


ACC 


TCA 


CGG 


TGG 


ACA 


TTC 


CGA TGC 


CCA 


GGA 


TGC 


CCT 


CAG 


GCC 


3188 


Lys Val 


Thr 


Sier Arg Trp 


Thr 


Phe 


Arg Cys 


Pro Gly 


Cys 


Pro 


Gin Ala 





1770 1775 1780 
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CTG TCT CAT GAT GAC TCC CAC TTC CAC GAG CGG CAC AAG TGC ATC AAC 3236 
Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys lie Asn 
1785 1790 1795 

5TTC TTC GTG AAG GTG TAC GGC TAG ATG CCC CTC CTG TAG ACG CAG TTC 3284 
Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe 
1800 1805 1810 1815 

AGG GTG GAT TCT GTG CTC TTC AAG ACA CGC CTG CCC CAT GAC AAG ACC 3332 
10 Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys Thr 
1820 1825 1830 



AAG TGC TTC AAG TTC ATC TAGGGGCAGC GCACG6TCTG GGGAAGAGGA 3380 
Lys Cys Phe Lys Phe lie 
15 1835 



TGAGCAGAGG 


GAGGAAGATG 


GCTCCCAAGG 


TTCCTAGGCA 


TTGCA6GACC 


TTGGGCACAT 


3440 


CTGCTGGTGG 

20 

CGCTGCCTTT 


GTGGCCCAGA 


GCCTCTGCTG 


GAAGGG6CAG 


CAGGAGGAGT 


GGAAGGAAAC 


3500 


ATCTTGAAGT 


CAGCCACACT 


GGGCCTGGAG 


CCCTGGGCGG 


A6TCCCCGGG 


3560 


GTTCCCCACA 


CAGGGCACTG 


ACTGATAGCT 


TACACTGAGG 


ACTGTGGCGA 


CTCTGCAGAG 


3620 


25TCACTCACAC 


CGTTCGTACG 


CCCAGGACAG 


CT6GTTCGTG 


GTTTTTACAT 


TCAATAACAA 


3680 


CTATTATGAT 


TATTTAAAAA 


GAGAAAGTTT 


CAGATTTGCC 


ATTCAAGGCT 


TATTTATATA 


3740 


TATGTGTGTG 
AGTGTGAGTT 


TATATAAATA 


CATGCACACA 


CTTGCATACA 


TATATATTTT 


TGGCTGGGGG 


3800 


TTGCCTTTCT 


AAGGGAGGGA 


CCGCGCAGGC 


TCCTTTGTTC 


TGTATTCTGG 


3860 


CGGAGTVTGGG 


TCCTGGCCTT 


GTGTCACTGG 


CTTATCCTTA 


AAGATCATCT 


CCCATCCTCC 


3920 


35CCAGCGCCAT 


CTGTGTGCAG 


CAACCAGAAA 


GGGATGAACT 


TGGCCCTCTT 


GCGGGCCTGG 


3980 


ACAAGGTCTC 


TTCCTTACCC 


TTTCTGTTGC 


CAGTCAGCAA 


CCTGTAACTC 


ACATTCTCTT 


4040 


CCCAGTGAAT 

40 

CCAGCGATTT 


CCCTGGGAGC 


GCCTGACCCT 


GGTGGGCTGT 


TCAGCTTCCT 


GCTGCTGGGG 


4100 


TTGAGGATTT 


ATCTTTAGGC 


CAGGCTTGCC 


TCCGTACTTA 


TCCCTGCTCT 


4160 


CCCATTTCTC 


TCTTGTTTGA 


GAGAGAATGA 


GGAAGCAAAG 


AGTGAGAAAG 


AATAGGGGCT 


4220 


45GAAGACGCCA 


CTCCCAGATG 


GCTCTTTCTA 


TCCTGCTCTT 


CTGTTGAAAC 


ACACGTGCTG 


4280 


TGGGCCTCAG 


GCGTTTCTGA 


AGTGrCTCTTT 


CTTGGATTGG 


ACAGGAi^TC 


AGCAGCGTGC 


4340 


ACATCTGCTG 

50 

CCAGTGTCCT 


TGGTCTGAAG 


TGGTTTGCAG 


GTCAGCCTCC 


TCTCCCTAGT 


GTAGA6CAAG 


4400 


TCGAGGAACC 


CACCCG6CTG 


GCCGGGAAGT 


TTTACAGCAA 


GGCGCCTGCC 


4460 


TTGGGATAAT 


TCCTTGGTGA 


AATTCACCTT 


CCCCCCGCCT 


CTGTCTGGAG 


CCCCATCCTG 


4520 


5 5TGTTATCTGT 


GGTTTTTGGA 


CCCCTAATGT 


CAGCTTGGCT 


GTAGGACTCC 


CCGAGGTTTG 


4580 


GTATGTGCTA 


GAACAATGGG 


AGGCTGTGAT 


TTGCTGTGTA 


AGCTCACATC 


CAGCCTTGGA 


4640 


ATCTAACGGG 

60 

CCTGGGCTGA 


CATTCACAAC 


CCGAGTTACC 


ACTTTCCACT 


CCCTGCTTAG 


GATTCTGTTC 


4700 


AACTGAAATA 


AGCTAATTTT 


TTGGGTCACG 


GTGGCAGTAG 


GGGAACCTAG 


4760 


GAGGGTGTGA 


GTGGCATTTG 


TCAGGGATTT 


AGCCCATGAC 


GTGTTTCTTG 


AACCCTACTT 


4820 
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TCTGGAAGTG 


GAGTTGACTC 


TGGAAGTTTT CTAGCAACTG AACAAAAGCT 


CAGGTTTGTC 


4880 


CTGGTCATGC 


ACATGCCTTA 


AGCCAGTTCC 


GTCTTCCCTA GACCTTGGCA TCCTGTGCTT 


4940 


BCTATTTCTTG 


GAATACGTTC 


TCCTCTGACC 


TGCCTGTACC ACGTGGGTCC 


TCTTCAAGTA 


5000 


CTGTTTTGAA 


GCTGGGCTCT 


TTTGTGTAGC 


TCCCACCCAC CTGTAGGGCT 


AGCTCGGCTT 


5060 


AAGGGAACTC 

10 

GTTCCCCGCC 


TCCCCATTGG 


CAAACCGGAC 


CCGGCCGCCG CCAGG7\CTGT 


GTTTCCA7UVG 


5120 


CCCAACCCCA 


GCATCAGCCT 


GTAGCTCCCC TGCTGAGGCA 


GTGTGGTTAT 


5180 


GTTCCC7VGCA 


GTGGGGGTCA 


GACGCCCTTC 


CTCAGAACTT TCTAGTTGCC 


CTCTACCTGA 


5240 


1 5CTCCTGACTT 


GTATTCCTTT 


TAGCAGTAGC 


CTTCTTCCCT CGGGGAGCCA AAGAGTGTGG 


5300 


TGTGTGGCGC 


TATATTGTGG 


CTGCTATTTC 


ATCTGGTTTC TTTTAATGTG AGGAACTCAC 


5360 


ATACTGACTT 

20 

CGGGACTCAG 


CAGTGGG3VCT 


CGGTGAGCCG 


GGGCCGTCTG TGTGGTGGGA 


CCCCCTTTAG 


5420 


TG9UX:TGGGG 


CCGTCTGTGT 


GGT6GAGCCA GGGCCTCTCC 


CTTTAGTGGA 


5480 


6CCAGGTT6T 


CGGGCCCCGA 


ATGTCACTG6 


TGGATCTAAG AAGGGCTGAG 


TGGTCTGACA 


5540 


25CCAAAAavrG 


CCGCAGGGA6 


GGCTGTGGTG 


CCGGTGCTTC CAACAAGGAC AGCCCTCCTT 


5600 


GACCCTGAAA 


GGTVACACTGG 


CTTGAAGGAC 


TGCAGACAGG CTCTGAGGGG 


CACGCCCTCC 


5660 


TCAGCGAGAG 

30 

AGCCGCCGAC 


GCAGCAAGGT 


GGCCACAGTG 


TCACTGGTCA GGTGCTTCTC 


ACCACGGGAA 


5720 


ctgtgactcg 


CTTGAGATGG 


GAAAGCGGCG CCACAGACCC 


CGGGTCTCCT 


5780 


TGGCTGTCTG 


TGGGCCGCCC 


CTGGCCACCT 


TGTCCTGGCT CGCAGGGTGC 


AGGAGCGCCT 


5840 


35CGTTCTCTGG 


GTGGCCGGCT 


TGCTGCTCCG 


GTTTGGGCTG TCTTACCATA ACACCGTCCC 


5900 


AGGGCTCTGC 


AGGCCACTGT 


GAGCGCTGGC 


TCCCTGGGCA GTGCTCCTCC 


GTGTGGACTG 


5960 


TGCCTCAGGC 

40 

GCTGCGCCGG 


CAGGGCTCAC 


CAGCTGGGGT 


CCTGTCCGGA AGGATGGGAT 


CTTTCTGGGA 


6020 


ACAGAGTGGG 


GAGCTCCTAG 


TTTGTGGGGG GAAGCTTTGA 


TATCCATGCC 


6080 


ACGTCCATCC 


ACCCCACCCC 


TTTTCGTCAC 


GAGCACAATG 6TCTTACATT 


GGATTTTTGT 


6140 


45AAAAAAATAA 


AAATAAATGG 


AGACTTTAAC 


TC 




6172 



(2) INFORMATION FOR SEQ ID NO: 4: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 . (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Gly Tyr Thr Met Leu Arg Asn Gly Gly Ala Gly Asn Gly Gly 
60 1 5 10 15 

Gin Thr Cys Met Leu Arg Trp Ser Asn Arg lie Arg Leu Thr Trp Leu 
20 25 30 



t 
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Ser Pbe Thr Leu Phe Val He Leu Val Phe Phe Pro Leu He Ala His 
35 40 45 

Tyr Tyr Leu Thr Thr Leu Asp Glu Ala Asp Glu Ala Gly Lys Arg He 
5 50 55 60 

Phe Gly Pro Arg Val Gly Asn Glu Leu Cys Glu Val Lys His Val Leu 
65 70 75 80 

lOAsp Leu Cys Arg He TUrg Glu Ser Val Ser Glu Glu Leu Leu Gin Leu 

85 90 95 

Glu Ala Lys Arg Gin Glu Leu Asn Ser Glu He Ala Lys Leu Asn Leu 
100 105 110 

15 

Lys He Glu Ala Cys Lys Lys Ser He Glu Asn Ala Lys Gin Asp Leu 
115 120 125 

Leu Gin Leu Lys Asn Val He Ser Gin Thr Glu His Ser Tyr Lys Glu 
20 130 135 140 

Leu Met Ala Gin Asn Gin Pro Lys Leu Ser Leu Pro He Arg Leu Leu 
145 150 155 160 

25Pro Glu Lys Asp Asp Ala Gly Leu Pro Pro Pro Lys Ala Thr Arg Gly 
165 170 175 

Cys Arg Leu His Asn Cys Phe Asp Tyr Ser Arg Cys Pro Leu Thr Ser 
180 185 190 

30 

Gly Phe Pro Val Tyr Val Tyr Asp Ser Asp Gin Phe Val Phe Gly Ser 
195 200 205 

Tyr Leu Asp Pro Leu Val Lys Gin Ala Phe Gin Ala Thr Ala Arg Ala 
35 210 215 220 

Asn Val Tyr Val Thr Glu Asn Ala Asp He Ala Cys Leu Tyr Val He 
225 230 235 240 

40Leu Val Gly Glu Met Gin Glu Pro Val Val Leu Arg Pro Ala Glu Leu 
245 250 255 

Glu Lys Gin Leu Tyr Ser Leu Pro His Trp Arg Thr Asp Gly His Asn 
260 265 270 

45 

His Val He He Asn Leu Ser Arg Lys Ser Asp Thr Gin Asn Leu Leu 
275 280 285 

Tyr Asn Val Ser Thr Gly Arg Ala Met Val Ala Gin Ser Thr Phe Tyr 
50 290 295 300 

Thr Val Gin Tyr Arg Pro Gly Phe Asp Leu Val Val Ser Pro Leu Val 
305 310 315 320 

55His Ala Met Ser Glu Pro Asn Phe Met Glu He Pro Pro Gin Val Pro 
325 330 335 

Val Lys Arg Lys Tyr Leu Phe Thr Phe Gin Gly Glu Lys He Glu Ser 
340 345 350 

60 

Leu Arg Ser Ser Leu Gin Glu Ala Arg Ser Phe Glu Glu Glu Met Glu 
355 360 365 
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Gly Asp Pro Pro Ala Asp Tyr Asp Asp Arg lie lie Ala Thr Leu Lys 
370 375 380 

Ala Val Gin Asp Ser Lys Leu Asp Gin Val Leu Val Glu Phe Thr Cys 
5385 390 395 400 

Lys Asn Gin Pro Lys Pro Ser Leu Pro Thr Glu Trp Ala Leu Cys Gly 
405 410 415 

10 Glu Arg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr Phe Ala Leu 
420 425 430 

lie He Thr Pro Gly Asp Pro Arg Leu Val He Ser Ser Gly Cys Ala 
435 440 445 

15 

Thr Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro Val Val Leu 
450 455 460 

Gly Glu Gin Val Gin Leu Pro Tyr Gin Asp Met Leu Gin Trp Asn Glu 
20465 470 475 480 

Ala J\la Leu Val Val Pro Lys Pro Arg Val Thr Glu Val His Phe Leu 
485 490 495 

25Leu Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg Arg Gin Gly 
500 505 510 

Arg Phe Leu Trp Glu Thr Tyr Phe Ser Thr Ala Asp Ser He Phe Asn 
515 520 525 

30 

Thr Val Leu Ala Met He Arg Thr Arg He Gin He Pro Ala Ala Pro 
530 535 540 

He Arg Glu Glu Ala Ala Ala Glii He Pro His Arg Ser Gly Lys Ala 
35545 550 555 560 

Ala Gly Thr Asp Pro Asn Met Ala Asp Asn Gly Asp Leu Asp Leu Gly 
565 570 575 

40Pro Val Glu Thr Glu Pro Pro Tyr Ala Ser Pro 7U:g Tyr Leu Arg Asn 
580 585 590 

Phe Thr Leu Thr Val Thr Asp Phe Tyr Arg Ser Trp Asn Cys Ala Pro 

595 600 605 

45 

Gly Pro Phe His Leu Phe Pro His Thr Pro Phe Asp Pro Val Leu Pro 
610 615 620 

Ser Glu Ala Lys Phe Leu Gly Ser Gly Thr Gly Phe Arg Pro He Gly 
5 0 625 630 635 640 

Gly Gly Ala Gly Gly Ser Gly Lys Glu Phe Gin Ala Ala Leu Gly Gly 
645 650 655 

55Asn Val Pro Arg Glu Gin Phe Thr Val Val Met Leu Thr Tyr Glu Arg 
660 665 670 

Glu Glu Val Leu Met Asn Ser Leu Glu Arg Leu Asn Gly Leu Pro Tyr 
675 680 685 

60 

Leu Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu 
690 695 700 
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Asp Leu Leu Trp Pro Asp lie Gly Val Pro lie Met Val Val Arg Thr 
705 710 715 720 

Glu Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu He Glu 
5 725 730 735 

Thr Glu Ala He Leu Ser He Asp Asp Asp Ala His Leu Arg His Asp 
740 745 750 

IOgIu He Met Phe Gly Phe Arg Val Trp Arg Glu Ala Arg Asp Arg He 
755 . 760 765 

Val Gly Phe Pro Gly Arg Tyr His Ala Trp Asp He Pro His Gin Ser 
770 775 780 

15 

Trp Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr 
785 790 795 800 

Gly Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val 
20 805 810 815 

Met Pro Gin Ala He Arg Asp Met Val Asp Glu Tyr He Asn Cys Glu 
820 825 830 

25Asp He Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro 
835 840 845 

He Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin 
850 855 860 

30 

Ala Leii Ser His Asp Asp iSer His Phe His Glu Arg His Lys Cys He 
865 870 875 880 

Asn Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin 
35 885 890 895 

Phe Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys 
900 905 910 

40Thr Lys Cys Phe Lys Phe He 
915 

(2) INFORMATION FOR SEQ ID NO: 5: 

45 {i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: protein 



55 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Leu Cys Gly Glu Arg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr 

15 10 15 

60 

Phe Ala Leu He He Thr Pro Gly Asp Pro Arg Leu Val He Ser Ser 
20 25 30 
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Gly Cys Ala Thr Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro 
35 40 45 

Val Val Leu Gly Glu Gin Val Gin Leu Pro Tyr Gin Asp Met Leu Gin 
5 50 55 60 

Trp Asn Glu Ala Ala Leu Val Val Pro Lys Pro Arg Val Thr Glu Val 
65 70 75 80 

10 His Phe Leu Leu Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg 

85 90 95 



15 



Arg Gin Gly Arg Phe Leu Trp Glu Thr Tyr Phe Pro Thr Ala Asp Ser 
100 105 110 

lie Phe Asn Thr Val Leu Ala Met lie Arg Thr Arg lie 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 6: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Arg Cys His Lys His Gin Val Phe Asp Tyr Pro Gin Val Leu Gin Glu 
35 1 5 10 15 

Ala Thr Phe Cys Val Val Leu Arg Gly Ala Arg Leu Gly Gin Ala Val 
20 25 30 

40 Leu Ser Asp Val Leu Gin Ala Gly Cys Val Pro Val Val lie Ala Asp 
35 40 45 



45 



Ser Tyr lie Leu Pro Phe Ser Glu Val Leu Asp Trp Lys TUrg Ala Ser 
50 55 60 

Val Val Val Pro Glu Glu Lys Met Ser Asp Val Tyr Ser lie Leu Gin 
65 70 75 80 



Ser lie Pro Gin Arg Gin lie Glu Glu Met Gin Arg Gin Ala Arg Trp 
50 85 90 95 

Phe Trp Glu Ala Tyr Phe Gin Ser lie Lys Ala lie Ala Leu Ala Thr 
100 105 110 

55 Leu Gin lie lie Asn Asp Arg lie 
115 120 

(2) INFORMATION FOR SEQ ID NO: 7: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEC^JENCE DESCRIPTION: SEQ ID NO: 7: 

10 Arg Cys Asp Arg Asp Asn Thr 61u Tyr Glu Lys Tyr Asp Tyr Arg Glu 
15 10 15 

Met Leu His Asn Ala Thr Phe Cys Leu Val Pro Arg Gly Arg Arg Leu 
20 25 30 

Gly Ser Phe Arg Phe Leu Glu Ala Leu Gin Ala Ala Cys Val Pro Val 
35 40 45 



15 



Met Leu Ser Asn Gly Trp Glu Leu Pro Phe Ser Glu Val lie Asn Trp 
20 50 55 60 

Asn Gin Ala Ala Val lie Gly Asp Glu Arg Leu Leu Leu Gin lie Pro 
65 70 75 80 

25 Ser Thr He Arg Ser He His Gin Asp Lys He Leu Ala Leu Arg Gin 

85 90 95 



30 



Gin Thr Gin Phe Leu Trp Glu Ala Tyr Phe Ser Ser Val Glu Lys He 
100 105 110 

Val Leu Thr Thr Leu Glu He He Gin Asp Arg He 
115 120 



<2) INFORMATION FOR SEQ ID NO: 8: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Arg Cys Glu Gin Asp Pro Gly Pro Gly Gin Thr Gin Arg Gin Glu Thr 
50 1 5 10 15 

Leu Pro Asn Ala Thr Phe Cys Leu He Ser Gly His Arg Pro Glu Ala 
20 25 30 

55 Ala Ser Arg Phe Leu Gin Ala Leu Gin Ala Gly Cys He Pro Val Leu 
35 40 45 



60 



Leu Ser Pro Arg Trp Glu Leu Pro Phe Ser Glu Val He Asp Trp Thr 

50 55 60 

Lys Ala Ala He Val Ala Asp Glu Arg Leu Pro Leu Gin Val Leu Ala 
65 70 75 80 



wo 00/18959 PCT/US99/21654 

20 



7U.a Leu Gin Glu Met Ser Pro Ala Arg Val Leu Ala Leu TVrg Gin Gin 
85 90 95 

Thr Gin Phe Leu Trp Asp Ala Tyr Phe Ser Ser Val Glu Lys Val lie 
5 100 105 110 

His Thr Thr Leu Glu Val He Gin Asp Arg He 
115 120 

10(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: andno acid 

15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



25 Lys Cys Ser Gin Glu Asn Cys Ser Leu Glu Arg Arg Arg Gin Leu He 
15 10 15 



30 



Gly Ser Ser Thr Phe Cys Phe Leu Leu Pro Ser Glu Met Phe Phe Gin 
20 25 30 

Asp Phe Leu Ser Ser Leu Gin Leu Gly Cys He Pro He Leu Leu Ser 
35 40 45 



T^n Ser Gin Leu Leu Pro Phe Gin Asp Leu He Asp Trp Arg Arg Ala 
35 50 55 60 

Thr Tyr Arg Leu Pro Leu Ala Arg Leu Pro Glu Ala His Phe He Val 
65 70 75 80 

40 Gin Ser Phe Glu He Ser Asp He He Glu Met Arg Arg Val Gly Arg 

85 90 95 



45 



Leu Phe Tyr Glu Thr Tyr Leu Ala Asp Arg His Leu Leu Ala Arg Ser 
100 105 110 

Leu Leu Ala Ala Leu Arg Tyr Lys Leu 
115 120 



(2) INFORMATION FOR SEQ ID NO: 10: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



60 

(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Val Pro Arg Glu Gin Phe Thr Val Val Met Leu Thr Tyr Glu Arg Glu 
15 10 15 

Glu Val Leu Met Asn Ser Leu Glu Arg Leu Asn Gly Leu Pro Tyr Leu 
20 25 30 

Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp 
35 40 45 

Leu Leu Trp Pro Asp lie Gly Val Pro lie Met Val Val Arg Thr Glu 
50 55 60 

Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu lie Glu Thr 
65 70 75 80 

Glu Ala lie Leu Ser lie Asp Asp Asp Ala His Leu Arg His Asp Glu 
.85 90 95 

lie Met Phe Gly Phe Arg Val Trp Arg Glu Ala Arg T^p Arg lie Val 
100 105 110 

Gly Phe Pro Gly Arg Tyr His Ala Trp Asp lie Pro His Gin Ser Trp 
115 120 125 

Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr Gly 
130 135 140 

Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met 
145 150 155 160 

Pro Gin Ala He Arg Asp Met Val Asp Glu Tyr He Asn Cys Glu Asp 
165 170 175 

He Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro He 
180 185 190 

Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala 
195 200 205 

Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys He Asn 
210 215 220 

Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe 
225 230 235 240 

Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys Thr 
245 250 255 

Lys Cys Phe Lys Phe He 
260 

2) INFORMATION FOR SSQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: TiniSar 



(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

Pro Gin Ser Gin Gly Phe Thr Gin lie Val Leu Thr Tyr Asp Arg Val 
15 10 15 

5 

Glu Ser Leu Phe Arg Val lie Thr Glu Val Ser Lys Val Pro Ser Lfeu 
20 25 30 

Ser Lys Leu Leu Val Val Trp Asn Asn Gin Asn Lys Asn Pro Pro Glu 
10 35 40 45 

Asp Ser Leu Trp Pro Lys lie Arg Val Pro Leu Lys Val Val Arg Thr 
50 55 60 

15 Ala Glu Asn Lys Leu Ser Asn Arg Phe Phe Pro Tyr Asp Glu He Glu 
65 70 75 80 

Thr Glu Ala Val Leu Ala He Asp Asp Asp He He Met Leu Thr Ser 
85 90 95 

Asp Glu Leu Gin Phe Gly Tyr Glu Val Trp Arg Glu Phe Pro Asp Arg 
100 105 110 



20 



Leu Val Gly Tyr Pro Gly Arg Leu His Leu Trp Asp His Glu Ala Met 
25 115 120 125 

Asn Lys Trp Lys Tyr Glu Ser Glu Trp Thr Asn Glu Val Ser rtet Val 
130 135 140 

30 Leu Thr Gly Ala Ala Phe Tyr His Lys Tyr Phe Asn Tyr Leu Tyr Thr 

145 150 155 160 

Lys Met Pro Gly Asp He Lys Asn Trp Val Asp Ala His Met Asn Cys 
165 170 175 



35 



50 



Tyr Glu Asp He Ala Met Asn Phe Leu Val Ala Asn Val Thr Gly Lys 
180 185 190 



Ala Val He Lys Val Thr Pro Arg Lys Lys Phe Lys Cys Pro Glu Cys 
40 195 200 205 

Thr Ala He Asp Gly Leu Ser Leu Asp Gin Thr His Met Val Glu Arg 
210 215 220 

45 Ser Glu Cys He Asn Lys Phe Ala Ser Val Phe Gly Thr Met Pro Leu 
225 230 235 240 



Lys Val Val Glu His Arg Ala Asp Pro Val Leu Tyr Lys Asp Asp Phe 
245 250 255 

Pro Glu Lys Leu Lys Ser Phe Pro Asn He Gly Ser Leu 
260 265 



(2) INFORMATION FOR SEQ ID NO: 12: 

55 

(i) SEQUENCE CHARACTEfllSTICS : 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Pro Pro Ser Lys Phe Thr Ala Val lie His Ala Val Thr Pro Leu Val 
15 10 15 

Ser Gin Ser Gin Pro Val Leu Lys Leu Leu Val Ala Ala Ala Lys Ser 

20 25 30 

Gin Tyr Cys Ala Gin lie lie Val Leu Trp Asn Cys Asp Lys Pro Leu 
35 40 45 

Pro Ala Lys His Arg Trp Pro Ala Thr Ala Val Pro Val Val Val He 
50 55 • 60 

Glu Gly Glu Ser Lys Val Met Ser Ser Arg Phe Leu Pro Tyr Asp Asn 
65 70 75 80 

He He Thr Asp Ala Val Leu Ser Leu Asp Glu Asp Thr Val Leu Ser 
85 90 95 

Thr Thr Glu Val Asp Phe Ala Phe Thr Val Trp Gin Ser Phe Pro Glu 
100 105 110 

Arg He Val Gly Tyr Pro Ala Arg Ser His Phe Trp Asp Asn Ser Lys 
115 120 125 

Glu Arg Trp Gly Tyr Thr Ser Lys Trp Thr Asn Asp Tyr Ser Met Val 
130 135 140 

Leu Thr Gly Ala Ala He Tyr His Lys Tyr Tyr His Tyr Leu Tyr Ser 
145 150 155 160 

His Tyr Leu Pro Ala Ser Leu Lys Asn Met Val Asp Gin Leu Ala Asn 
165 170 175 

Cys Glu Asp He Leu Met Asn Phe Leu Val Ser Ala Val Thr Lys Leu 
180 185 190 

Pro Pro He Lys Val Thr Gin Lys Lys Gin Tyr Lys Glu Thr Met Met 
195 200 205 

Gly Gin Thr Ser Arg Ala Ser Arg Trp Ala Asp Pro Asp His Phe Ala 
210 215 220 

Gin Arg Gin Ser Cys Met Asn Thr Phe Ala Ser Trp Phe Gly Tyr Met 
225 230 235 240 

Pro Leu He His Ser Gin Met Arg Leu Asp Pro Val Leu Lys Asp Gin 
245 250 255 

Val Ser He Leu Arg Lys Lys Tyr Arg Asp He Glu Arg Leu 
260 265 270 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



25 



40 



55 



24 

(ii) MOLBCULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Pro Glu Gly Arg Phe Ser Ala Leu lie Trp Val Gly Pro Pro Gly Gin 
1 5 10 15 

Pro Pro Leu Lys Leu He Gin Ala Val Ala Gly Ser Gin His Cys Ala 
20 25 30 



Gin He Leu Val Leu Trp Ser Asn Glu Arg Pro Leu Pro Ser Arg Trp 
15 35 40 45 

Pro Glu Thr Ala Val Pro Leu Thr Val He Asp Gly His Arg Lys Val 
50 55 60 

20 Ser Asp Arg Phe Tyr Pro Tyr Ser Thr He Arg Thr Asp Ala He Leu 
65 70 75 80 



Ser Leu Asp Ala Arg Ser Ser Leu Ser Thr Ser Glu Val Asp Phe Ala 

85 90 - 95 

Phe Leu Val Trp Gin Ser Phe Pro Glu Arg Met Val Gly Phe Leu Thr 
100 105 110 



Ser Ser His Phe Trp Asp Glu Ala His Gly Gly Trp Gly Tyr Thr Ala 
30 115 120 125 

Glu Arg Thr Asn Glu Phe Ser Met Val Leu Thr Thr Ala Ala Phe Tyr 
130 135 140 

35 His Arg Tyr Tyr His Thr Leu Phe Thr His Ser Leu Pro Lys Ala Leu 
145 150 155 160 

Arg Thr Leu Ala Asp Glu Ala Pro Thr Cys Val Asp Val Leu Met Asn 
165 170 175 



Phe He Val Ala Ala Val Thr Lys Leu Pro Pro He Lys Val Pro Tyr 
180 185 190 



Gly Lys Gin Arg Gin Glu Ala Ala Pro Leu Ala Pro Gly Gly Pro Gly 
45 195 200 205 

Pro Arg Pro Lys Pro Pro Ala Pro Ala Pro Asp cys He Asn Gin He 
210 215 220 

50 Ala Ala Ala Phe Gly His Met Pro Leu Leu Ser Ser Arg Leu Arg Leu 
225 230 235 240 



Asp Pro Val Leu Phe Lys Asp Pro Val Ser Val Gin Arg Lys Lys Tyr 
245 250 255 

Arg Ser Leu Glu Lys Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 14: 

60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



5 



10 



25 



40 



55 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 

Ser Thr Met Asp Ser Phe Thr Leu lie Met Gin Thr Tyr Asn Arg Thr 
1 5 .10 15 



Asp Leu Leu Leu Lys Leu Leu Asn His Tyr Gin Ala Val Pro Asn Leu 
15 20 25 3d 

His Lys Val lie Val Val Trp Asn Asn lie Gly Glu Lys Ala Pro Asp 
35 40 45 

20 Glu Leu Trp Asn Ser Leu Gly Pro His Pro lie Pro Val He Phe Lys 
50 55 60 



Gin Gin Thr Ala Asn Arg Met Arg Asn Arg Leu Gin Val Phe Pro Glu 
65 70 75 80 

Leu Glu Thr Asn Ala Val Leu Met Val Asp Asp Asp Thr Leu He Ser 
85 90 95 



Thr Pro Asp Leu Val Phe Ala Phe Ser Val Trp Gin Gin Phe Pro Asp 
30 100 105 110 

Gin lie Val Gly Phe Val Pro Arg Lys His Val Ser Thr Ser Ser Gly 
115 120 125 

35 He Tyr Ser Tyr Gly Ser Phe Glu Met Gin 7\la Pro Gly Ser Gly Asn 
130 135 140 



Gly Asp Gin Tyr Ser Met Val Leu He Gly Ala Ser Phe Phe Asn Ser 
145 150 155 160 

Lys Tyr Leu Glu Leu Phe Gin Arg Gin Pro Ala Ala Val His Ala Leu 
165 170 175 



He Asp Asp Thr Gin Asn Cys Asp Asp He Ala Met Asn Phe He He 
45 180 185 190 

Ala Lys His He Gly Lys Thr Ser Gly He Phe Val Lys Pro Val Asn 
195 200 205 

50 Met Asp Asn Leu Glu Lys Glu Thr 7\sn Ser Gly Tyr Ser Gly Met Trp 
210 215 220 



His Arg Ala Glu His Ala Leu Gin Arg Ser Tyr Cys He Asn Lys Leu 

225 230 235 240 

Val Asn He Tyr Asp Ser Met Pro Leu Arg Tyr Ser Asn He Met He 

245 250 255 



Ser Gin Phe Gly Phe Pro Tyr Ala Asn Tyr Lys Arg Lys He 
60 260 265 270 

(2) INFORMATION FOR Sm ID NO: 15: 
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(i) SEQUENCE CHTU^CTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Arg Gin Arg Glu Gin Phe Thr Val Val Leu Leu Thr Tyr Glu Arg Asp 
1 5 . 10 15 

Ala Val Leu Thr Gly Ala Leu Glu Arg Leu His Gin Leu Pro Tyr Leu 
20 25 30 

Asn Lys lie lie Val Val Trp Asn Asn Val Asn Arg Asp Pro Pro Asp 
35 40 45 

Ser Trp Pro Ser Leu His He Pro Val Glu Phe He Arg Val Ala Glu 
50 55 60 

Asn Asn Leu Asn Asn Arg Phe Val Pro Trp Asp Arg He Glu Thr Glu 
65 70 75 80 

Ala Val Leu Ser Leu Asp Asp Asp He Asp Leu Met Gin Gin Glu He 
85 90 95 

He Leu Ala Phe Arg Val Trp Arg Glu Asn Arg Asp Arg He Val Gly 
100 105 110 

Phe Pro Ala Arg His His Ala Arg Tyr Gly Asp Ser Met Phe Tyr Asn 
115 120 125 

Ser Asn His Thr Cys Gin Met Ser Met He Leu Thr Gly Ala Ala Phe 

130 135 140 

He His Lys Asn Tyr Leu Thr Ala Tyr Thr Tyr Glu Met Pro Ala Glu 
145 150 155 160 

He Arg Glu His Val Asn Ser He Lys Asn Cys Glu Asp He Ala Met 
165 170 175 

Asn Tyr Leu Val Ser His Leu Thr Arg Lys Pro Pro He Lys Thr Thr 
180 185 190 

Ser Arg Trp Thr Leu Lys Cys Pro Thr Cys Thr Glu Ser Leu Tyr Lys 
195 200 205 

Glu Gly Thr His Phe Glu Lys Arq His Glu Cys Met Arg Leu Phe Thr 
210 215 220 

Lys He Tyr Gly Tyr Asn Pro Leu Lys Phe Ser Gin Phe Arg Ala Asp 
225 230 235 240 

Ser He Leu Phe Lys Thr Arg Leu Pro Gin Asn His Gin Lys Cys Phe 
245 250 255 



Lys Tyr Val 
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(2) INFORMATION FOR SEQ ID NO: 16: 

i (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TTATGGCGAG TGACCCGACG TG 
20(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

35TTGCTAZ\AGT GAAGGAAGTT GG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ACCCG7U:GTG ATCTGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STR/UQDEDNESS : single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

5AAGAGCTCCT GCAGCTGG 

(2) INFORMATION FOR SEQ ID NO: 20: 

<i) SEQUENCE CHARACTERISTICS: 
10 I A) LENGTH: 18 base pairs 

(B) TYPE:- nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
TTCTCGTTGC CCTCTCAC 
(2) INFORMATION FOR SEQ ID NO:21: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genraoic) 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 



ATCATCAATC TGTCACG 

40 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

55 

ACTACGATGA CCGGATC 
(2) INFORMATION FOR SEQ ID NO: 23: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



5 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

10 Arg Cys Asp Arg Asp Asn Thr Glu Tyr Glu Lys Tyr Asp Tyr Arg Glu 
15 10 15 

Met Leu His Asn Ala Thr Phe Cys Leu Val Pro TVrg Gly Arg Arg Leu 
20 25 30 



15 



Gly Ser Phe Arg Phe Leu Glu Ala Leu Gin Ala Ala Cys Val Pro Val 
35 40 45 



Met Leu Ser Asn Gly Trp Glu Leu Pro Phe Ser Glu Val lie Asn Trp 

20 50 55 60 

Asn Gin Ala Ala Val lie Gly Asp Glu Arg Leu Leu Leu Gin lie Pro 
65 70 75 80 

25 Ser Thr lie Arg Ser lie His Gin Asp Lys lie Leu Ala Leu Arg Gin 

85 90 95 



30 



Gin Thr Gin Phe Leu Trp Glu Ala Tyr Phe Ser Ser Val Glu Lys lie 
100 105 110 

Val Leu Thr Thr Leu Glu lie lie Gin Asp Arg lie 
115 120 



(2) INFORMATION FOR SEQ ID NO: 8: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Arg Cys Glu Gin Asp Pro Gly Pro Gly Gin Thr Gin Arg Gin Glu Thr 
50 1 5 10 15 

Leu Pro Asn Ala Thr Phe Cys Leu lie Ser Gly His Arg Pro Glu Ala 
20 25 30 

55 Ala Ser Arg Phe Leii Gin Ala Leu Gin Ala Gly Cys He Pro Val Leu 
35 40 45 



60 



Leu Ser Pro Arg Trp Glu Leu Pro Phe Ser Glu Val He Asp Trp Thr 
50 55 60 

Lys Ala Ala He Val Ala Asp Glu Arg Leu Pro Leu Gin Val Leu Ala 
65 70 75 80 
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Ala Leu Gin Glu Met Ser Pro Ala Arg Val Leu Ala Leu Arg Gin Gin 
85 90 95 

Thr Gin Phe Leu Trp Asp Ala Tyr Phe Ser Ser Val Glu Lys Val He 
5 100 105 110 

His Thr Thr Leu Glu Val He Gin Asp Arg He 
115 120 

10(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



25 Lys Cys Ser Gin Glu Asn Cys Ser Leu Glu Arg Arg Arg Gin Leu He 

15 10 15 



30 



Gly Ser Ser Thr Phe Cys Phe Leu Leu Pro Ser Glu Met Phe Phe Gin 
20 25 30 

Asp Phe Leu Ser Ser Leu Gin Leu Gly Cys He Pro He Leu Leu Ser 
35 40 45 



Asn Ser Gin Leu Leu Pro Phe Gin Asp Leu He Asp Trp Arg Arg Ala 
35 50 55 60 

Thr Tyr Arg Leu Pro Leu Ala Arg Leu Pro Glu Ala His Phe He Val 
65 70 75 80 

40 Gin Ser Phe Glu He Ser Asp He He Glu Met Arg Arg Val Gly Arg 

85 90 95 



45 



lieu Phe Tyr Glu Thr Tyr Leu Ala Asp Arg His Leu Leu Ala Arg Ser 
100 105 110 

Leu Leu Ala Ala Leu Arg Tyr Lys Leu 
115 120 



(2) INFORMATION FOR SEQ ID NO: 10: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
55 <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

60 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Val Pro Arg Glu Gin Phe Thr Val Val Met I*eu Thr Tyr Glu Arg Glu 
15 10 15 

Glu Val Leu Met Asn Ser Leu Glu Arg Leu Hsn Gly Leu Pro Tyr Leu 
20 25 30 

Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp 
35 40 45 

Leu Leu Trp Pro Asp lie Gly Val Pro lie Met Val Val Arg Thr Glu 
50 55 60 

Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu lie Glu Thr 
65 70 75 80 

Glu Ala lie Leu Ser lie Asp Asp Asp Ala His Leu Arg His Asp Glu 
.85 90 95 

He Met Phe Gly Phe Arg Val Trp Arg Glu Ala Arg Asp Arg He Val 
100 105 110 

Gly Phe Pro Gly Arg Tyr His Ala Trp Asp He Pro His Gin Ser Trp 
115 120 125 

Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr Gly 
130 135 140 

Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met 
145 150 155 160 

Pro Gin Ala He Arg Asp Met Val Asp Glu Tyr He Asn Cys Glu Asp 
165 170 175 

He Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro He 
180 185 190 

Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin 7U.a 
195 200 205 

Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys He Asn 
210 215 220 

Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe 
225 230 235 240 

Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys Thr 
245 250 255 

Lys cys Phe Lys Phe He 
260 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: TinSar 

(ii) MOLECULE TYPE: protein 



wo 00/18959 V PCT/US99/21654 

.i 22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Gin Ser Gin Gly Phe Thr Gin. He Val Leu Thr Tyr Asp Arg Val 
15 10 15 

5 

Glu Ser Leu Phe Arg Val He Thr Glu Val Ser Lys Val Pro Ser Leu 
20 25 30 

Ser Lys Leu Leu Val Val Trp Asn Asn Gin Asn Lys Asn Pro Pro Glu 
10 35 40 45 

Asp Ser Leu Trp Pro Lys He Arg Val Pro Leu Lys Val Val Arg Thr 
50 55 60 

15 Ala Glu Asn Lys Leu Ser Asn JUrg Phe Phe Pro Tyr Asp Glu He Glu 
65 70 75 80 



20 



35 



50 



Thr Glu Ala Val Leu Ala He Asp Asp Asp He He Met Leu Thr Ser 
85 90 95 

Asp Glu Leu Gin Phe Gly Tyr Glu Val Trp Arg Glu Phe Pro Asp Arg 
100 105 110 



Leu Val Gly Tyr Pro Gly Arg Leu His Leu Trp 2\sp His Glu Ala Met 
25 115 120 125 

Asn Lys Trp Lys Tyr Glu Ser Glu Trp Thr Asn Glu Val Ser Met Val 
130 135 140 

30 Leu Thr Gly Ala Ala Phe Tyr His Lys Tyr Phe Asn Tyr Leu Tyr Thr 
145 150 155 160 

Lys Met Pro Gly Asp He Lys Asn Trp Val Asp Ala His Met Asn Cys 
165 170 175 



Tyr Glu Asp He Ala Met Asn Phe Leu Val Ala Asn Val Thr Gly Lys 
180 185 190 



Ala Val He Lys Val Thr Pro Arg Lys Lys Phe Lys Cys Pro Glu Cys 
40 195 200 205 

Thr Ala He Asp Gly Leu Ser Leu Asp Gin Thr His Met Val Glu Arg 
210 215 220 

45 Ser Glu Cys He Asn Lys Phe Ala Ser Val Phe Gly Thr Met Pro Leu 
225 230 235 240 



Lys Val Val Glu His Arg Ala T^p Pro Val Leu Tyr Lys Asp Asp Phe 
245 250 255 

Pro Glu Lys Leu Lys Ser Phe Pro Asn He Gly Ser Leu 
260 265 



(2) INFORMATION FOR SEQ ID NO: 12: 

55 

(i) SEQUENCE CHARACTE]E11STICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Pro Pro Ser Lys Phe Thr Ala Val He His Ala Val Thr Pro Leu Val 
15 10 15 

Ser Gin Ser Gin Pro Val Leu Lys Leu Leu Val Ala Ala Ala Lys Ser 
20 25 30 

Gin Tyr Cys Ala Gin He He Val Leu Trp Asn Cys Asp Lys Pro Leu 
35 40 45 

Pro Ala Lys His Arg Trp Pro Ala Thr Ala Val Pro Val Val Val He 
50 55 . 60 

Glu Gly Glu Ser Lys Val Met Ser Ser Arg Phe Leu Pro Tyr Asp Asn 
65 70 75 80 

He He Thr Asp Ala Val Leu Ser Leu Asp Glu Asp Thr Val Leu Ser 
85 90 95 

Thr Thr Glu Val Asp Phe Ala Phe Thr Val Trp Gin Ser Phe Pro Glu 
100 105 - 110 

Arg He Val Gly Tyr Pro Ala Arg Ser His Phe Trp Asp Asn Ser Lys 
115 120 125 

Glu Arg Trp Gly Tyr Thr Ser Lys Trp Thr Asn Asp Tyr ser Met Val 
130 135 140 

Leu Thr Gly Ala Ala He Tyr His Lys Tyr Tyr His Tyr Leu Tyr Ser 
145 150 155 160 

His Tyr Leu Pro Ala Ser Leu Lys Asn Met Val Asp Gin Leu Ala Asn 
165 170 175 

Cys Glu Asp He Leu Met Asn Phe Leu Val Ser Ala Val Thr Lys Leu 
180 185 190 

Pro Pro He Lys Val Thr Gin Lys Lys Gin Tyr Lys Glu Thr Met Met 
195 200 205 

Gly Gin Thr Ser Arg Ala Ser Arg Trp Ala Asp Pro Asp His Phe Ala 
210 215 220 

Gin Arg Gin Ser Cys Met 7\sn Thr Phe Ala Ser Trp Phe Gly Tyr Met 
225 230 235 240 

Pro Leu He His Ser Gin Met Arg Leu Asp Pro Val Leu Lys Asp Gin 
245 250 255 

Val Ser He Leu Arg Lys Lys Tyr Arg Asp He Glu Arg Leu 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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24 



10 



25 



40 



55 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Pro Glu Gly Arg Phe Ser Ala Leu He Trp Val Gly Pro Pro Gly Gin 
15 10 15 

Pro Pro Leu Lys Leu He Gin Ala Val Ala Gly Ser Gin His Cys Ala 
20 25 30 



Gin He Leu Val Leu Trp Ser Asn Glu Arg Pro Leu Pro Ser Arg Trp 
15 35 40 45 

Pro Glu Thr Ala Val Pro Leu Thr Val He Asp Gly His Arg Lys Val 
50 55 60 

20 Ser Asp Arg Phe Tyr Pro Tyr Ser Thr He Arg Thr Asp Ala He Leu 
65 70 75 80 



Ser Leu Asp Ala Arg Ser Ser Leu Ser Thr Ser Glu Val Asp Phe Ala 

85 90 - 95 

Phe Leu Val Trp Gin Ser Phe Pro Glu Arg Met Val Gly Phe Leu Thr 
100 105 110 



Ser Ser His Phe Trp Asp Glu Ala His Gly Gly Trp Gly Tyr Thr Ala 
30 115 120 125 

Glu Arg Thr Tlsn Glu Phe Ser Met Val Leu Thr Thr Ala Ala Phe Tyr 
130 135 140 

35 His Arg Tyr Tyr His Thr Leu Phe Thr His Ser Leu Pro Lys Ala Leu 
145 150 155 160 



Arg Thr Leu Ala Asp Glu Ala Pro Thr Cys Val Asp Val Leu Met Asn 

165 170 175 

Phe He Val Ala Ala Val Thr Lys Leu Pro Pro He Lys Val Pro Tyr 
180 185 190 



Gly Lys Gin Arg Gin Glu Ala Ala Pro Leu Ala Pro Gly Gly Pro Gly 
45 195 200 205 

Pro Arg Pro Lys Pro Pro JU.a Pro Ala Pro Asp Cys He Asn Gin He 

210 215 220 

50 Ala Ala Ala Phe Gly His Met Pro Leu Leu Ser Ser Arg Leu Arg Leu 
225 230 235 240 



Asp Pro Val Leu Phe Lys Asp Pro Val Ser Val Gin Arg Lys Lys Tyr 
245 250 255 

Arg Ser Leu Glu Lys Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 14: 

60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



5 



10 



25 



40 



55 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Ser Thr Met Asp Ser Phe Thr Leu lie Met Gin Thr Tyr Asn Arg Thr 
1 5 .10 15 

Asp Leu Leu Leu Lys Leu Leu Asn His Tyr Gin Ala Val Pro Asn Leu 
15 20 25 3d 

His Lys Val lie Val Val Trp Asn T^n lie Gly Glu Lys Ala Pro Asp 
35 40 45 

20 Glu Leu Trp Asn Ser Leu Gly Pro His Pro lie Pro Val lie Phe Lys 
50 55 60 



Gin Gin Thr Ala Asn Arg Met Arg Asn Arg Leu Gin Val Phe Pro Glu 
65 70 75 ^ 80 

Leu Glu Thr Asn Ala Val Leu Met Val Asp Asp Asp Thr Leu lie Ser 
85 90 95 



Thr Pro Asp Leu Val Phe Ala Phe Ser Val Trp Gin Gin Phe Pro Asp 
30 100 105 110 

Gin lie Val Gly Phe Val Pro Arg Lys His Val Ser Thr Ser Ser Gly 
115 120 125 

35 lie Tyr Ser Tyr Gly Ser Phe Glu Met Gin Ala Pro Gly Ser Gly Asn 
130 135 140 



Gly Asp Gin Tyr Ser Met Val Leu lie Gly Ala Ser Phe Phe Asn Ser 
145 150 155 160 

Lys Tyr Leu Glu Leu Phe Gin Arg Gin Pro Ala Ala Val His Ala Leu 
165 170 175 



lie Asp Asp Thr Gin Asn Cys Asp Asp He Ala Met Asn Phe He He 
45 180 185 190 

Ala Lys His He Gly Lys Thr Ser Gly He Phe Val Lys Pro Val Asn 

195 200 205 

50 Met Asp Asn Leu Glu Lys Glu Thr Asn Ser Gly Tyr Ser Gly Met Trp 
210 215 220 

His Arg Ala Glu His Ala Leu Gin Arg Ser Tyr Cys He Asn Lys Leu 
225 230 235 240 



Val Asn He Tyr Asp Ser Met Pro Leu Arg Tyr Ser Asn He Met He 
245 250 255 



Ser Gin Phe Gly Phe Pro Tyr Ala Asn Tyr Lys Arg Lys He 
60 260 265 270 

(2) INFORMATION FOR 3BQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 

Arg Gin Arg Glu Gin Phe Thr Val Val Leu Leu Thr Tyr Glu Arg Asp 
1 5 . 10 15 

Ala Val Leu Thr Gly Ala Leu Glu Arg Leu His Gin Leu Pro Tyr Leu 
20 25 30 

Asn Lys lie lie Val Val Trp Asn Asn Val Asn Arg Asp Pro Pro Asp 
35 40 45 

Ser Trp Pro Ser Leu His lie Pro Val Glu Phe lie Arg Val Ala Glu 
50 55 60 

Asn Asn Leu Asn Asn Arg Phe Val Pro Trp Asp Arg lie Glu Thr Glu 
65 70 75 80 

Ala Val Leu Ser Leu Asp Asp Asp lie Asp Leu Met Gin Gin Glu lie 
85 90 95 

lie Leu Ala Phe Arg Val Trp Arg Glu Asn Arg Asp Arg lie Val Gly 
100 105 110 

Phe Pro Ala Arg His His Ala Arg Tyr Gly Asp Ser Met Phe Tyr Asn 
115 120 125 

Ser Asn His Thr Cys Gin Met Ser Met He Leu Thr Gly Ala Ala Phe 
130 135 140 

He His Lys Asn Tyr Leu Thr Ala Tyr Thr Tyr Glu Met Pro Ala Glu 
145 150 155 160 

He Arg Glu His Val Asn Ser He Lys Asn Cys Glu Asp He Ala Met 
165 170 175 

Asn Tyr Leu Val Ser His Leu Thr Arg Lys Pro Pro He Lys Thr Thr 
180 185 190 

Ser Arg Trp Thr Leu Lys Cys Pro Thr Cys Thr Glu Ser Leu Tyr Lys 
195 200 205 

Glu Gly Thr His Phe Glu Lys Arg His Glu Cys Met Arg Leu Phe Thr 
210 215 220 

Lys He Tyr Gly Tyr Asn Pro Leu Lys Phe Ser Gin Phe Arg Ala Asp 
225 230 235 240 

Ser He Leu Phe Lys Thr Arg Leu Pro Gin Asn His Gin Lys Cys Phe 
245 250 255 

Lys Tyr Val 
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(2) XKFORMATION FOR SEQ ID NO: 16: 

; (1) SEQUENCE CHTUUVCTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
TTATGGCGAG TGACCCGACG T6 
20(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DMA (genomic) 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

35TTGCTAAAGT GAAGGAAGTT GG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
ACCCGACGTG ATCTGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA. (genomic) 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

IOTTCCCTACCA GGACATGC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AACATGGCTG ACAACG 
(2) INFORMATION FOR SEQ ID NO: 25: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TATTGGTGGT GGAGCTGG 

45 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

60 

AATCCAGCCA TGGTCTCCTT GG 
(2) INFORMATION FOR SEQ ID NO: 27: 
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(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGTCGATGCC ATTATTACCA GC 

15 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

30 

TTCCTTCCTC ATCACAG 
(2) INFORMATION FOR SEQ ID NO: 29: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: DNA (genomic) 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 
AGGTCTGTGT ATGCACTTGT G 
50(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
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AGTCGATGCC ATTATTACCA. GC 
(2) INFORMATION FOR SEQ ID NO: 31: 

S (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TTCAAGGGTG TGGAGAG 
20(2) INFORMTVTION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

35TTGGCTGAAA GCCAACAACC TG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
AACATGCACG CATCCACAGC 
(2) INFORMATION FOR SEQ ID NO: 34: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

5TTGTAACAa\ GCATGTGG 1 8 

(2) INF0RM7VTI0N FOR SEQ ID NO: 35: 

(i) SEQUENCE CH7VRACTERISTICS : 
10 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANPEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

GCTTCTGTCA GTATTAGCTG GG 22 
(2) INFORMATION FOR SEQ ID NO: 36: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
TTCCTCCCTC TGCTCATCCT C 21 

40 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) mLECULE TYPE: DNA (igenomic) 

50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

55 

TTCCCACTCT GTCTCTC 
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